Object Discovery in Videos as Foreground Motion Clustering

Christopher Xie, Yu Xiang, Zaid Harchaoui, Dieter Fox; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 9994-10003

Abstract


We consider the problem of providing dense segmentation masks for object discovery in videos. We formulate the object discovery problem as foreground motion clustering, where the goal is to cluster foreground pixels in videos into different objects. We introduce a novel pixel-trajectory recurrent neural network that learns feature embeddings of foreground pixel trajectories linked across time. By clustering the pixel trajectories using the learned feature embeddings, our method establishes correspondences between foreground object masks across video frames. To demonstrate the effectiveness of our framework for object discovery, we conduct experiments on commonly used datasets for motion segmentation, where we achieve state-of-the-art performance.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Xie_2019_CVPR,
author = {Xie, Christopher and Xiang, Yu and Harchaoui, Zaid and Fox, Dieter},
title = {Object Discovery in Videos as Foreground Motion Clustering},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}