Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow

Jia Li, Anlin Zheng, Xiaowu Chen, Bin Zhou; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 1417-1425

Abstract


This paper proposes a novel approach for segmenting primary video objects by using Complementary Convolutional Neural Networks (CCNN) and neighborhood reversible flow. The proposed approach first pre-trains CCNN on massive images with manually annotated salient objects in an end-to-end manner, and the trained CCNN has two separate branches that simultaneously handle two complementary tasks, i.e., foregroundness and backgroundness estimation. By applying CCNN on each video frame, the spatial foregroundness and backgroundness maps can be initialized, which are then propagated between various frames so as to segment primary video objects and suppress distractors. To enforce efficient temporal propagation, we divide each frame into superpixels and construct neighborhood reversible flow that reflects the most reliable temporal correspondences between superpixels in far-away frames. Within such flow, the initialized foregroundness and backgroundness can be efficiently and accurately propagated along the temporal axis so that primary video objects gradually pop-out and distractors are well suppressed. Extensive experimental results on three video datasets show that the proposed approach achieves impressive performance in comparisons with 18 state-of-the-art models.

Related Material


[pdf]
[bibtex]
@InProceedings{Li_2017_ICCV,
author = {Li, Jia and Zheng, Anlin and Chen, Xiaowu and Zhou, Bin},
title = {Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}