Evaluating the Immediate Applicability of Pose Estimation for Sign Language Recognition

Amit Moryossef, Ioannis Tsochantaridis, Joe Dinn, Necati Cihan Camgoz, Richard Bowden, Tao Jiang, Annette Rios, Mathias Muller, Sarah Ebling; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2021, pp. 3434-3440

Abstract


Signed languages are visual languages produced by the movement of the hands, face, and body. In this paper, we evaluate representations based on skeleton poses, as these are explainable, person-independent, privacy-preserving, low-dimensional representations. Basically, skeletal representations generalize over an individual's appearance and background, allowing us to focus on the recognition of motion. But how much information is lost by the skeletal representation? We perform two independent studies using two state-of-the-art pose estimation systems. We analyze the applicability of the pose estimation systems to sign language recognition by evaluating the failure cases of the recognition models. Importantly, this allows us to characterize the current limitations of skeletal pose estimation approaches in sign language recognition.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Moryossef_2021_CVPR, author = {Moryossef, Amit and Tsochantaridis, Ioannis and Dinn, Joe and Camgoz, Necati Cihan and Bowden, Richard and Jiang, Tao and Rios, Annette and Muller, Mathias and Ebling, Sarah}, title = {Evaluating the Immediate Applicability of Pose Estimation for Sign Language Recognition}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2021}, pages = {3434-3440} }