Box in the Box: Joint 3D Layout and Object Reasoning from Single Images

Alexander G. Schwing, Sanja Fidler, Marc Pollefeys, Raquel Urtasun; The IEEE International Conference on Computer Vision (ICCV), 2013, pp. 353-360

Abstract


In this paper we propose an approach to jointly infer the room layout as well as the objects present in the scene. Towards this goal, we propose a branch and bound algorithm which is guaranteed to retrieve the global optimum of the joint problem. The main difficulty resides in taking into account occlusion in order to not over-count the evidence. We introduce a new decomposition method, which generalizes integral geometry to triangular shapes, and allows us to bound the different terms in constant time. We exploit both geometric cues and object detectors as image features and show large improvements in 2D and 3D object detection over state-of-the-art deformable part-based models.

Related Material


[pdf]
[bibtex]
@InProceedings{Schwing_2013_ICCV,
author = {Schwing, Alexander G. and Fidler, Sanja and Pollefeys, Marc and Urtasun, Raquel},
title = {Box in the Box: Joint 3D Layout and Object Reasoning from Single Images},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {December},
year = {2013}
}