Temporal Poselets for Collective Activity Detection and Recognition

Moin Nabi, Alessio Del Bue, Vittorio Murino; Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops, 2013, pp. 500-507

Abstract


Detection and recognition of collective human activities are important modules of any system devoted to high-level social behavior analysis. In this paper, we present a novel semantic-based spatio-temporal descriptor which can cope with several interacting people at different scales and multiple activities in a video. Our descriptor is suitable for modelling the human motion interaction in crowded environments the scenario most difficult to analyse because of occlusions. In particular, we extend the Poselet detector approach by defining a descriptor based on Poselet activation patterns over time, named TPOS. We will show that this descriptor can effectively tackle complex real scenarios allowing to detect humans in the scene, to localize (in space-time) human activities, and perform collective group activity recognition in a joint manner, reaching state-of-theart results.

Related Material


[pdf]
[bibtex]
@InProceedings{Nabi_2013_ICCV_Workshops,
author = {Moin Nabi and Alessio Del Bue and Vittorio Murino},
title = {Temporal Poselets for Collective Activity Detection and Recognition},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {June},
year = {2013}
}