Video Anomaly Detection via Sequentially Learning Multiple Pretext Tasks

Chenrui Shi, Che Sun, Yuwei Wu, Yunde Jia; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 10330-10340

Abstract


Learning multiple pretext tasks is a popular approach to tackle the nonalignment problem in unsupervised video anomaly detection. However, the conventional learning method of simultaneously learning multiple pretext tasks, is prone to sub-optimal solutions, incurring sharp performance drops. In this paper, we propose to sequentially learn multiple pretext tasks according to their difficulties in an ascending manner to improve the performance of anomaly detection. The core idea is to relax the learning objective by starting with easy pretext tasks in the early stage and gradually refine it by involving more challenging pretext tasks later on. In this way, our method is able to reduce the difficulties of learning and avoid converging to sub-optimal solutions. Specifically, we design a tailored sequential learning order for three widely-used pretext tasks. It starts with frame prediction task, then moves on to frame reconstruction task and last ends with frame-order classification task. We further introduce a new contrastive loss which makes the learned representations of normality more discriminative by pushing normal and pseudo-abnormal samples apart. Extensive experiments on three datasets demonstrate the effectiveness of our method.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Shi_2023_ICCV, author = {Shi, Chenrui and Sun, Che and Wu, Yuwei and Jia, Yunde}, title = {Video Anomaly Detection via Sequentially Learning Multiple Pretext Tasks}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {10330-10340} }