The Multi-Modal Video Reasoning and Analyzing Competition

Peng, Haoran; Huang, He; Xu, Li; Li, Tianjiao; Liu, Jun; Rahmani, Hossein; Ke, Qiuhong; Guo, Zhicheng; Wu, Cong; Li, Rongchang; Ye, Mang; Wang, Jiahao; Zhang, Jiaxu; Liu, Yuanzhong; He, Tao; Zhang, Fuwei; Liu, Xianbin; Lin, Tao

Haoran Peng, He Huang, Li Xu, Tianjiao Li, Jun Liu, Hossein Rahmani, Qiuhong Ke, Zhicheng Guo, Cong Wu, Rongchang Li, Mang Ye, Jiahao Wang, Jiaxu Zhang, Yuanzhong Liu, Tao He, Fuwei Zhang, Xianbin Liu, Tao Lin; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2021, pp. 806-813

Abstract

In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summarize the top performing methods submitted by the participants in this competition and show their results achieved in the competition.

Related Material

[pdf] [arXiv]

[bibtex]

@InProceedings{Peng_2021_ICCV, author = {Peng, Haoran and Huang, He and Xu, Li and Li, Tianjiao and Liu, Jun and Rahmani, Hossein and Ke, Qiuhong and Guo, Zhicheng and Wu, Cong and Li, Rongchang and Ye, Mang and Wang, Jiahao and Zhang, Jiaxu and Liu, Yuanzhong and He, Tao and Zhang, Fuwei and Liu, Xianbin and Lin, Tao}, title = {The Multi-Modal Video Reasoning and Analyzing Competition}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2021}, pages = {806-813} }