RepPoints: Point Set Representation for Object Detection

Ze Yang, Shaohui Liu, Han Hu, Liwei Wang, Stephen Lin; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 9657-9666


Modern object detectors rely heavily on rectangular bounding boxes, such as anchors, proposals and the final predictions, to represent objects at various recognition stages. The bounding box is convenient to use but provides only a coarse localization of objects and leads to a correspondingly coarse extraction of object features. In this paper, we present RepPoints (representative points), a new finer representation of objects as a set of sample points useful for both localization and recognition. Given ground truth localization and recognition targets for training, RepPoints learn to automatically arrange themselves in a manner that bounds the spatial extent of an object and indicates semantically significant local areas. They furthermore do not require the use of anchors to sample a space of bounding boxes. We show that an anchor-free object detector based on RepPoints can be as effective as the state-of-the-art anchor-based detection methods, with 46.5 AP and 67.4 AP_ 50 on the COCO test-dev detection benchmark, using ResNet-101 model. Code is available at \color cyan .

Related Material

[pdf] [supp]
author = {Yang, Ze and Liu, Shaohui and Hu, Han and Wang, Liwei and Lin, Stephen},
title = {RepPoints: Point Set Representation for Object Detection},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2019}