Fusing Spatiotemporal Features and Joints for 3D Action Recognition

Yu Zhu, Wenbin Chen, Guodong Guo; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2013, pp. 486-491

Abstract


We present a novel approach to 3D human action recognition based on a feature-level fusion of spatiotemporal features and skeleton joints. First, 3D interest points detection and local feature description are performed to extract spatiotemporal motion information. Then the frame difference and pairwise distances of skeleton joint positions are computed to characterize the spatial information of the joints in 3D space. These two features are complementary to each other. A fusion scheme is then proposed to combine them effectively based on the random forests method. The proposed approach is validated on three challenging 3D action datasets for human action recognition. Experimental results show that the proposed approach outperforms the state-of-the-art methods on all three datasets.

Related Material


[pdf]
[bibtex]
@InProceedings{Zhu_2013_CVPR_Workshops,
author = {Zhu, Yu and Chen, Wenbin and Guo, Guodong},
title = {Fusing Spatiotemporal Features and Joints for 3D Action Recognition},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2013}
}