An Empirical Study of Detection-Based Video Instance Segmentation

Qiang Wang, Yi He, Xiaoyun Yang, Zhao Yang, Philip Torr; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 0-0

Abstract


Video instance segmentation (VIS) is a composite task that requires the joint detection, tracking, and segmentation of objects in a video. In this work, we introduce a complete framework for VIS, which integrates the strengths of instance segmentation and general object tracking in addressing the unique challenges of VIS. In developing the framework, we investigate effective ways of coordinating the two components for maximum benefits while thoroughly investigate their separate contributions. Our approach improves over the official baseline by an absolute 14.4% in mAP and achieves the second place in the 2019 YouTubeVIS challenge.

Related Material


[pdf]
[bibtex]
@InProceedings{Wang_2019_ICCV,
author = {Wang, Qiang and He, Yi and Yang, Xiaoyun and Yang, Zhao and Torr, Philip},
title = {An Empirical Study of Detection-Based Video Instance Segmentation},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month = {Oct},
year = {2019}
}