Cataloging Public Objects Using Aerial and Street-Level Images - Urban Trees

Jan D. Wegner, Steven Branson, David Hall, Konrad Schindler, Pietro Perona; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 6014-6023

Abstract


Each corner of the inhabited world is imaged from multiple viewpoints with increasing frequency. Online map services like Google Maps or Here Maps provide direct access to huge amounts of densely sampled, georeferenced images from street view and aerial perspective. There is an opportunity to design computer vision systems that will help us search, catalog and monitor public infrastructure, buildings and artifacts. We explore the architecture and feasibility of such a system. The main technical challenge is combining test time information from multiple views of each geographic location (e.g., aerial and street views). We implement two modules: det2geo, which detects the set of loca- tions of objects belonging to a given category, and geo2cat, which computes the fine-grained category of the object at a given location. We introduce a solution that adapts state-of-the-art CNN-based object detectors and classifiers. We test our method on "Pasadena Urban Trees", a new dataset of 80,000 trees with geographic and species annotations, and show that combining multiple views significantly improves both tree detection and tree species classification, rivaling human performance.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Wegner_2016_CVPR,
author = {Wegner, Jan D. and Branson, Steven and Hall, David and Schindler, Konrad and Perona, Pietro},
title = {Cataloging Public Objects Using Aerial and Street-Level Images - Urban Trees},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2016}
}