Patch-Level Augmentation for Object Detection in Aerial Images

Hong, Sungeun; Kang, Sungil; Cho, Donghyeon

Sungeun Hong, Sungil Kang, Donghyeon Cho; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 0-0

Abstract

Object detection in specific views (e.g., top view, road view, and aerial view) suffers from a lack of dataset, which causes class imbalance and difficulties of covering hard examples. In order to handle these issues, we propose a hard chip mining method that makes the ratio of each class balanced and generates hard examples that are efficient for model training. First, we generate multi-scale chips to train object detector. Next, we extract object patches from the dataset to construct an object pool; then those patches are used to augment the dataset. By this augmentation, we can overcome the class imbalance problem. After that, we perform inference with the trained detector on augmented images, then generate hard chips from misclassified regions. Finally, we train the final detector by both normal and hard chips. The proposed method achieves superior results on VisDrone dataset both qualitatively and quantitatively. Also, our model is ranked 3rd in VisDrone-DET2019 challenge (http://aiskyeye.com/).

Related Material

[pdf]

[bibtex]

@InProceedings{Hong_2019_ICCV,
author = {Hong, Sungeun and Kang, Sungil and Cho, Donghyeon},
title = {Patch-Level Augmentation for Object Detection in Aerial Images},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month = {Oct},
year = {2019}
}