Positive-Unlabeled Data Purification in the Wild for Object Detection

Guo, Jianyuan; Han, Kai; Wu, Han; Zhang, Chao; Chen, Xinghao; Xu, Chunjing; Xu, Chang; Wang, Yunhe

Jianyuan Guo, Kai Han, Han Wu, Chao Zhang, Xinghao Chen, Chunjing Xu, Chang Xu, Yunhe Wang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 2653-2662

Abstract

Deep learning based object detection approaches have achieved great progress with the benefit from large amount of labeled images. However, image annotation remains a laborious, time-consuming and error-prone process. To further improve the performance of detectors, we seek to exploit all available labeled data and excavate useful samples from massive unlabeled images in the wild, which is rarely discussed before. In this paper, we present a positive-unlabeled learning based scheme to expand training data by purifying valuable images from massive unlabeled ones, where the original training data are viewed as positive data and the unlabeled images in the wild are unlabeled data. To effectively utilized these purified data, we propose a self-distillation algorithm based on hint learning and ground truth bounded knowledge distillation. Experimental results verify that the proposed positive-unlabeled data purification can strengthen the original detector by mining the massive unlabeled data. In particular, our method boosts the mAP of FPN by +2.0% on COCO benchmark.

Related Material

[pdf]

[bibtex]

@InProceedings{Guo_2021_CVPR, author = {Guo, Jianyuan and Han, Kai and Wu, Han and Zhang, Chao and Chen, Xinghao and Xu, Chunjing and Xu, Chang and Wang, Yunhe}, title = {Positive-Unlabeled Data Purification in the Wild for Object Detection}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2021}, pages = {2653-2662} }