Robust Video Segment Proposals With Painless Occlusion Handling

Zhengyang Wu, Fuxin Li, Rahul Sukthankar, James M. Rehg; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 4194-4203

Abstract


We propose a robust algorithm to generate video segment proposals. The proposals generated by our method can start from any frame in the video and are robust to complete occlusions. Our method does not assume specific motion models and even has a limited capability to generalize across videos. We build on our previous least squares tracking framework, where image segment proposals are generated and tracked using learned appearance models. The innovation in our new method lies in the use of two efficient moves, the merge move and free addition, to efficiently start segments from any frame and track them through complete occlusions, without much additional computation. Segment size interpolation is used for effectively detecting occlusions. We propose a new metric for evaluating video segment proposals on the challenging VSB-100 benchmark and present state-of-the-art results. Preliminary results are also shown for the potential use of our framework to track segments across different videos.

Related Material


[pdf]
[bibtex]
@InProceedings{Wu_2015_CVPR,
author = {Wu, Zhengyang and Li, Fuxin and Sukthankar, Rahul and Rehg, James M.},
title = {Robust Video Segment Proposals With Painless Occlusion Handling},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}
}