SemiHand: Semi-Supervised Hand Pose Estimation With Consistency

Linlin Yang, Shicheng Chen, Angela Yao; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 11364-11373

Abstract


We present SemiHand, a semi-supervised framework for 3D hand pose estimation from monocular images. We pre-train the model on labelled synthetic data and fine-tune it on unlabelled real-world data by pseudo-labeling with consistency training. By design, we introduce data augmentation of differing difficulties, consistency regularizer, label correction and sample selection for RGB-based 3D hand pose estimation. In particular, by approximating the hand masks from hand poses, we propose a cross-modal consistency and leverage semantic predictions to guide the predicted poses. Meanwhile, we introduce pose registration as label correction to guarantee the biomechanical feasibility of hand bone lengths. Experiments show that our method achieves a favorable improvement on real-world datasets after fine-tuning.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Yang_2021_ICCV, author = {Yang, Linlin and Chen, Shicheng and Yao, Angela}, title = {SemiHand: Semi-Supervised Hand Pose Estimation With Consistency}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {11364-11373} }