Improving Object Detection With Deep Convolutional Networks via Bayesian Optimization and Structured Prediction

Yuting Zhang, Kihyuk Sohn, Ruben Villegas, Gang Pan, Honglak Lee; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 249-258

Abstract


Object detection systems based on the deep convolutional neural network (CNN) have recently made ground- breaking advances on several object detection benchmarks. While the features learned by these high-capacity neural networks are discriminative for categorization, inaccurate localization is still a major source of error for detection. Building upon high-capacity CNN architectures, we address the localization problem by 1) using a search algorithm based on Bayesian optimization that sequentially proposes candidate regions for an object bounding box, and 2) training the CNN with a structured loss that explicitly penalizes the localization inaccuracy. In experiments, we demonstrated that each of the proposed methods improves the detection performance over the baseline method on PASCAL VOC 2007 and 2012 datasets. Furthermore, two methods are complementary and significantly outperform the previous state-of-the-art when combined.

Related Material


[pdf] [video]
[bibtex]
@InProceedings{Zhang_2015_CVPR,
author = {Zhang, Yuting and Sohn, Kihyuk and Villegas, Ruben and Pan, Gang and Lee, Honglak},
title = {Improving Object Detection With Deep Convolutional Networks via Bayesian Optimization and Structured Prediction},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}
}