SoccerNet-Depth: a Scalable Dataset for Monocular Depth Estimation in Sports Videos

Arnaud Leduc, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 3280-3292

Abstract


Monocular Depth Estimation (MDE) is fundamental in sports video understanding enhancing augmented graphics scene understanding and game state reconstruction. Despite remarkable progress in autonomous driving and indoor scene understanding there is currently a lack of MDE datasets tailored for sports. Furthermore most existing datasets only focus on single images disregarding the temporal aspect. In this work we introduce the first video dataset for MDE in sports SoccerNet-Depth focusing on football and basketball videos. In particular we leverage the graphic engine from video games to automatically extract video sequences and their associated depth maps making our dataset easily scalable. Furthermore we benchmark and fine-tune several state-of-the-art MDE methods on our dataset. Our analysis shows that MDE in sports is far from being solved making our dataset a perfect playground for future research. Dataset and codes: https://github.com/SoccerNet/sn-depth.

Related Material


[pdf]
[bibtex]
@InProceedings{Leduc_2024_CVPR, author = {Leduc, Arnaud and Cioppa, Anthony and Giancola, Silvio and Ghanem, Bernard and Van Droogenbroeck, Marc}, title = {SoccerNet-Depth: a Scalable Dataset for Monocular Depth Estimation in Sports Videos}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {3280-3292} }