Hockey Action Recognition via Integrated Stacked Hourglass Network

Mehrnaz Fani, Helmut Neher, David A. Clausi, Alexander Wong, John Zelek; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2017, pp. 29-37

Abstract


A convolutional neural network (CNN) has been designed to interpret player actions in ice hockey video. The hourglass network is employed as the base to generate player pose estimation and layers are added to this network to produce action recognition. As such, the unified architecture is referred to as action recognition hourglass network, or ARHN. ARHN has three components. The first component is the latent pose estimator, the second transforms latent features to a common frame of reference, and the third performs action recognition. Since no benchmark dataset for pose estimation or action recognition is available for hockey players, we generate such an annotated dataset. Experimental results show action recognition accuracy of 65% for four types of actions in hockey. When similar poses are merged to three and two classes, the accuracy rate increases to 71% and 78%, proving the efficacy of the methodology for automated action recognition in hockey.

Related Material


[pdf]
[bibtex]
@InProceedings{Fani_2017_CVPR_Workshops,
author = {Fani, Mehrnaz and Neher, Helmut and Clausi, David A. and Wong, Alexander and Zelek, John},
title = {Hockey Action Recognition via Integrated Stacked Hourglass Network},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {July},
year = {2017}
}