Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition

Tang, Yansong; Tian, Yi; Lu, Jiwen; Li, Peiyang; Zhou, Jie

Yansong Tang, Yi Tian, Jiwen Lu, Peiyang Li, Jie Zhou; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 5323-5332

Abstract

In this paper, we propose a deep progressive reinforcement learning (DPRL) method for action recognition in skeleton-based videos, which aims to distil the most informative frames and discard ambiguous frames in sequences for recognizing actions. Since the choices of selecting representative frames are multitudinous for each video, we model the frame selection as a progressive process through deep reinforcement learning, during which we progressively adjust the chosen frames by taking two important factors into account: (1) the quality of the selected frames and (2) the relationship between the selected frames to the whole video. Moreover, considering the topology of human body inherently lies in a graph-based structure, where the vertices and edges represent the hinged joints and rigid bones respectively, we employ the graph-based convolutional neural network to capture the dependency between the joints for action recognition. Our approach achieves very competitive performance on three widely used benchmarks.

Related Material

[pdf]

[bibtex]

@InProceedings{Tang_2018_CVPR,
author = {Tang, Yansong and Tian, Yi and Lu, Jiwen and Li, Peiyang and Zhou, Jie},
title = {Deep Progressive Reinforcement Learning for Skeleton-Based Action Recognition},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}