CDAD: A Common Daily Action Dataset With Collected Hard Negative Samples

Wangmeng Xiang, Chao Li, Ke Li, Biao Wang, Xian-sheng Hua, Lei Zhang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022, pp. 3921-3930

Abstract


The research on action understanding has achieved significant progress with the establishment of various benchmark datasets. However, the results of action understanding are far from satisfactory in practice. One reason is that the existing action datasets ignore the existence of many hard negative samples in real-world scenarios, which are usually undefined confusion actions, e.g., holding a pen near the mouth vs. smoking. In this work, we focus on the common actions in our daily life and present a novel Common Daily Action Dataset (CDAD), which consists of 57,824 video clips of 23 well-defined common daily actions with rich manual annotations. Particularly, for each daily action, we collect not only diverse positive samples but also various hard negative samples that have minor differences (share similarities) in action with the positive ones. The established CDAD dataset could not only serve as a benchmark for several important daily action understanding tasks, including multi-label action recognition, temporal action localization, and spatial-temporal action detection but also provide a testbed for researchers to investigate the influence of highly similar negative samples in learning action understanding models. The established CDAD dataset will be released for research purposes.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Xiang_2022_CVPR, author = {Xiang, Wangmeng and Li, Chao and Li, Ke and Wang, Biao and Hua, Xian-sheng and Zhang, Lei}, title = {CDAD: A Common Daily Action Dataset With Collected Hard Negative Samples}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2022}, pages = {3921-3930} }