-
[pdf]
[bibtex]@InProceedings{Biradar_2024_ACCV, author = {Biradar, Kuldeep and Tyagi, Dinesh Kumar and Battula, Ramesh Babu and Subbarao, P}, title = {Robust Anomaly Detection through Transformer-Encoded Feature Diversity Learning}, booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV) Workshops}, month = {December}, year = {2024}, pages = {115-128} }
Robust Anomaly Detection through Transformer-Encoded Feature Diversity Learning
Abstract
Detecting irregularities in weather data serves multiple practical applications. For example, nowcasting focuses on predicting atmospheric conditions for the next 0 to 4 hours, which is essential for effective emergency response and disaster management. Anomaly detection also plays a crucial role in forecasting extreme weather events. However, the complexity increases when considering both anomalies and varying weather conditions. A common approach to anomaly detection is weakly supervised video-level labeling, which aims to identify frames containing abnormal events and is typically framed as a multiple instance learning (MIL) problem. While existing methods perform well, the prevalence of negative instances significantly hinders their ability to detect positive instances, particularly rare abnormal segments. To address this, we aim to extract distinctive features by enhancing the observable differences between various classes using a single branch. We propose a novel method, Transformer Encoded Feature Video Anomaly Detection (TEF-VAD), which exclusively utilizes attention mechanisms, specifically Multi-Head Attention Learning. This approach combines feature magnitude learning loss, class-specific loss, and a TEF-VAD-enhanced MIL classifier training loss, thereby training a model to effectively identify positive examples and improve the MIL method's robustness for detecting positive instances in abnormal videos. Our extensive experiments demonstrate that the MIL model enhanced by our Transformer method significantly improves sample efficiency and the detection of subtle anomalies, outperforming several state-of-the-art techniques on benchmark datasets like UCF-Crime and ShanghaiTech.
Related Material