Building Part-Based Object Detectors via 3D Geometry

Abhinav Shrivastava, Abhinav Gupta; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013, pp. 1745-1752

Abstract


This paper proposes a novel part-based representation for modeling object categories. Our representation combines the effectiveness of deformable part-based models with the richness of geometric representation by defining parts based on consistent underlying 3D geometry. Our key hypothesis is that while the appearance and the arrangement of parts might vary across the instances of object categories, the constituent parts will still have consistent underlying 3D geometry. We propose to learn this geometrydriven deformable part-based model (gDPM) from a set of labeled RGBD images. We also demonstrate how the geometric representation of gDPM can help us leverage depth data during training and constrain the latent model learning problem. But most importantly, a joint geometric and appearance based representation not only allows us to achieve state-of-the-art results on object detection but also allows us to tackle the grand challenge of understanding 3D objects from 2D images.

Related Material


[pdf]
[bibtex]
@InProceedings{Shrivastava_2013_ICCV,
author = {Shrivastava, Abhinav and Gupta, Abhinav},
title = {Building Part-Based Object Detectors via 3D Geometry},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}
}