Learning to Combine Mid-Level Cues for Object Proposal Generation

Tom Lee, Sanja Fidler, Sven Dickinson; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015, pp. 1680-1688

Abstract


In recent years, region proposals have replaced sliding windows in support of object recognition, offering more discriminating shape and appearance information through improved localization. One powerful approach for generating region proposals is based on minimizing parametric energy functions with parametric maxflow. In this paper, we introduce Parametric Min-Loss (PML), a novel structured learning framework for parametric energy functions. While PML is generally applicable to different domains, we use it in the context of region proposals to learn to combine a set of mid-level grouping cues to yield a small set of object region proposals with high recall. Our learning framework accounts for multiple diverse outputs, and is complemented by diversification seeds based on image location and color. This approach casts perceptual grouping and cue combination in a novel structured learning framework which yields baseline improvements on VOC 2012 and COCO 2014.

Related Material


[pdf]
[bibtex]
@InProceedings{Lee_2015_ICCV,
author = {Lee, Tom and Fidler, Sanja and Dickinson, Sven},
title = {Learning to Combine Mid-Level Cues for Object Proposal Generation},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2015}
}