Refining Joint Locations for Human Pose Tracking in Sports Videos

Dan Zecha, Moritz Einfalt, Rainer Lienhart; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2019, pp. 0-0

Abstract


The estimation of an athlete's pose in video footage enables the automation of athletic performance assessment, the prediction of motion kinematics and dynamics in sports videos and the possibility of technology-assisted, direct training feedback. Despite remarkable progress in the field of deep learning assisted human pose estimation, the performance of such systems decreases while noise and errors increase with the complexity of the scene. In this paper, we focus on aquatic training scenarios, where even novel pose estimators produce several types of orthogonal errors, including joint swaps and prediction outliers. In order to improve the estimation of an athlete's pose in swimming, we propose a graph partitioning problem that connects pose estimates over time and explicitly allows for joints to switch labels if their location better fits each other's trajectory. We optimize the problem using integer linear programming, which partitions the graph into the most probable joint trajectories. We show experimentally that our method of joint rectification improves the joint detection precision of swimmers in a swimming channel by 0.8%-4.8% PCK for anti-symmetrical motion and up to 1.8% PCK for symmetrical styles.

Related Material


[pdf]
[bibtex]
@InProceedings{Zecha_2019_CVPR_Workshops,
author = {Zecha, Dan and Einfalt, Moritz and Lienhart, Rainer},
title = {Refining Joint Locations for Human Pose Tracking in Sports Videos},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2019}
}