Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections

True Price, Johannes L. Schönberger, Zhen Wei, Marc Pollefeys, Jan-Michael Frahm; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 1926-1935

Abstract


Image-based 3D reconstruction for Internet photo collections has become a robust technology to produce impressive virtual representations of real-world scenes. However, several fundamental challenges remain for Structure-from-Motion (SfM) pipelines, namely: the placement and reconstruction of transient objects only observed in single views, estimating the absolute scale of the scene, and (suprisingly often) recovering ground surfaces in the scene. We propose a method to jointly address these remaining open problems of SfM. In particular, we focus on detecting people in individual images and accurately placing them into an existing 3D model. As part of this placement, our method also estimates the absolute scale of the scene from object semantics, which in this case constitutes the height distribution of the population. Further, we obtain a smooth approximation of the ground surface and recover the gravity vector of the scene directly from the individual person detections. We demonstrate the results of our approach on a number of unordered Internet photo collections, and we quantitatively evaluate the obtained absolute scene scales.

Related Material


[pdf] [Supp]
[bibtex]
@InProceedings{Price_2018_CVPR,
author = {Price, True and Schönberger, Johannes L. and Wei, Zhen and Pollefeys, Marc and Frahm, Jan-Michael},
title = {Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}