Hand PointNet: 3D Hand Pose Estimation Using Point Sets

Liuhao Ge, Yujun Cai, Junwu Weng, Junsong Yuan; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 8417-8426

Abstract


Convolutional Neural Network (CNN) has shown promising results for 3D hand pose estimation in depth images. Different from existing CNN-based hand pose estimation methods that take either 2D images or 3D volumes as the input, our proposed Hand PointNet directly processes the 3D point cloud that models the visible surface of the hand for pose regression. Taking the normalized point cloud as the input, our proposed hand pose regression network is able to capture complex hand structures and accurately regress a low dimensional representation of the 3D hand pose. In order to further improve the accuracy of fingertips, we design a fingertip refinement network that directly takes the neighboring points of the estimated fingertip location as input to refine the fingertip location. Experiments on three challenging hand pose datasets show that our proposed method outperforms state-of-the-art methods.

Related Material


[pdf] [supp] [video]
[bibtex]
@InProceedings{Ge_2018_CVPR,
author = {Ge, Liuhao and Cai, Yujun and Weng, Junwu and Yuan, Junsong},
title = {Hand PointNet: 3D Hand Pose Estimation Using Point Sets},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}