StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation

Francesco Ragusa, Giovanni Maria Farinella, Antonino Furnari; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2023, pp. 3636-3645

Abstract


Anticipation problem has been studied considering different aspects such as predicting humans' locations, predicting hands and objects trajectories, and forecasting actions and human-object interactions. In this paper, we studied the short-term object interaction anticipation problem from the egocentric point of view, proposing a new end-to-end architecture named StillFast. Our approach simultaneously processes a still image and a video detecting and localizing next-active objects, predicting the verb which describes the future interaction and determining when the interaction will start. Experiments on the large-scale egocentric dataset EGO4D show that our method outperformed state-of-the-art approaches on the considered task. Our method is ranked first in the public leaderboard of the EGO4D short term object interaction anticipation challenge 2022 and it is the official baseline for the 2023 one. Please see the project web page for code and additional details: https://iplab.dmi.unict.it/stillfast/.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Ragusa_2023_CVPR, author = {Ragusa, Francesco and Farinella, Giovanni Maria and Furnari, Antonino}, title = {StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2023}, pages = {3636-3645} }