Scaling Up Instance Annotation via Label Propagation

Dim P. Papadopoulos, Ethan Weber, Antonio Torralba; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 15364-15373


Manually annotating object segmentation masks is very time-consuming. While interactive segmentation methods offer a more efficient alternative, they become unaffordable at a large scale because the cost grows linearly with the number of annotated masks. In this paper, we propose a highly efficient annotation scheme for building large datasets with object segmentation masks. At a large scale, images contain many object instances with similar appearance. We exploit these similarities by using hierarchical clustering on mask predictions made by a segmentation model. We propose a scheme that efficiently searches through the hierarchy of clusters and selects which clusters to annotate. Humans manually verify only a few masks per cluster, and the labels are propagated to the whole cluster. Through a large-scale experiment to populate 1M unlabeled images with object segmentation masks for 80 object classes, we show that (1) we obtain 1M object segmentation masks with an total annotation time of only 290 hours; (2) we reduce annotation time by 76x compared to manual annotation; (3) the segmentation quality of our masks is on par with those from manually annotated datasets. Code, data, and models are available online.

Related Material

[pdf] [supp] [arXiv]
@InProceedings{Papadopoulos_2021_ICCV, author = {Papadopoulos, Dim P. and Weber, Ethan and Torralba, Antonio}, title = {Scaling Up Instance Annotation via Label Propagation}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {15364-15373} }