Spatio-temporal Human-Object Interactions for Action Recognition in Videos

Victor Escorcia, Juan Carlos Niebles; Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops, 2013, pp. 508-514

Abstract


We introduce a new method for representing the dynamics of human-object interactions in videos. Previous algorithms tend to focus on modeling the spatial relationships between objects and actors, but ignore the evolving nature of this relationship through time. Our algorithm captures the dynamic nature of human-object interactions by modeling how these patterns evolve with respect to time. Our experiments show that encoding such temporal evolution is crucial for correctly discriminating human actions that involve similar objects and spatial human-object relationships, but only differ on the temporal aspect of the interaction, e.g. answer phone and dial phone We validate our approach on two human activity datasets and show performance improvements over competing state-of-the-art representations.

Related Material


[pdf]
[bibtex]
@InProceedings{Escorcia_2013_ICCV_Workshops,
author = {Victor Escorcia and Juan Carlos Niebles},
title = {Spatio-temporal Human-Object Interactions for Action Recognition in Videos},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {June},
year = {2013}
}