TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection

Wei, Yunchao; Shen, Zhiqiang; Cheng, Bowen; Shi, Honghui; Xiong, Jinjun; Feng, Jiashi; Huang, Thomas

Yunchao Wei, Zhiqiang Shen, Bowen Cheng, Honghui Shi, Jinjun Xiong, Jiashi Feng, Thomas Huang; Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 434-450

Abstract

This work provides a simple approach to discover tight object bounding boxes with only image-level supervision, called Tight box mining with Surrounding Segmentation Context (TS2C). We observe that object candidates mined through current multiple instance learning methods are usually trapped to discriminative object parts, rather than the entire object. TS2C leverages surrounding segmentation context derived from weakly-supervised segmentation to suppress such low-quality distracting candidates and boost the high-quality ones. Specifically, TS2C is developed based on two key properties of desirable bounding boxes: 1) high purity, meaning most pixels in the box are with high object response, and 2) high completeness, meaning the box covers high object response pixels comprehensively. With such novel and computable criteria, more tight candidates can be discovered for learning a better object detector. With TS2C, we obtain 48.0% and 44.4% mAP scores on VOC 2007 and 2012 benchmarks, which are the new state-of-the-arts.

Related Material

[pdf] [arXiv]

[bibtex]

@InProceedings{Wei_2018_ECCV,
author = {Wei, Yunchao and Shen, Zhiqiang and Cheng, Bowen and Shi, Honghui and Xiong, Jinjun and Feng, Jiashi and Huang, Thomas},
title = {TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
month = {September},
year = {2018}
}