Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition

Rasha Friji, Hassen Drira, Faten Chaieb, Hamza Kchok, Sebastian Kurtek; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 12611-12620

Abstract


Deep Learning architectures, albeit successful in mostcomputer vision tasks, were designed for data with an un-derlying Euclidean structure, which is not usually fulfilledsince pre-processed data may lie on a non-linear space.In this paper, we propose a geometry aware deep learn-ing approach using rigid and non rigid transformation opti-mization for skeleton-based action recognition. Skeleton se-quences are first modeled as trajectories on Kendall's shapespace and then mapped to the linear tangent space. The re-sulting structured data are then fed to a deep learning archi-tecture, which includes a layer that optimizes over rigid andnon rigid transformations of the 3D skeletons, followed bya CNN-LSTM network. The assessment on two large scaleskeleton datasets, namely NTU-RGB+D and NTU-RGB+D120, has proven that the proposed approach outperformsexisting geometric deep learning methods and exceeds re-cently published approaches with respect to the majority of configurations.

Related Material


[pdf]
[bibtex]
@InProceedings{Friji_2021_ICCV, author = {Friji, Rasha and Drira, Hassen and Chaieb, Faten and Kchok, Hamza and Kurtek, Sebastian}, title = {Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {12611-12620} }