Efficient Hand Pose Estimation from a Single Depth Image

Chi Xu, Li Cheng; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013, pp. 3456-3462

Abstract


We tackle the practical problem of hand pose estimation from a single noisy depth image. A dedicated three-step pipeline is proposed: Initial estimation step provides an initial estimation of the hand in-plane orientation and 3D location; Candidate generation step produces a set of 3D pose candidate from the Hough voting space with the help of the rotational invariant depth features; Verification step delivers the final 3D hand pose as the solution to an optimization problem. We analyze the depth noises, and suggest tips to minimize their negative impacts on the overall performance. Our approach is able to work with Kinecttype noisy depth images, and reliably produces pose estimations of general motions efficiently (12 frames per second). Extensive experiments are conducted to qualitatively and quantitatively evaluate the performance with respect to the state-of-the-art methods that have access to additional RGB images. Our approach is shown to deliver on par or even better results.

Related Material


[pdf]
[bibtex]
@InProceedings{Xu_2013_ICCV,
author = {Xu, Chi and Cheng, Li},
title = {Efficient Hand Pose Estimation from a Single Depth Image},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}
}