MOGeo: Beyond One-to-One Cross-View Object Geo-localization

Bo Lv, Qingwang Zhang, Le Wu, Yuanyuan Li, Yingying Zhu; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026, pp. 26422-26431

Abstract


Cross-View Object Geo-Localization (CVOGL) aims to locate an object of interest in a query image within a corresponding satellite image. Existing methods typically assume that the query image contains only a single object, which does not align with the complex, multi-object geo-localization requirements in real-world applications, making them unsuitable for practical scenarios. To bridge the gap between the realistic setting and existing task, we propose a new task, called Cross-View Multi-Object Geo-Localization (CVMOGL). To advance the CVMOGL task, we first construct a benchmark, CMLocation, which includes two datasets: CMLocation-V1 and CMLocation-V2. Furthermore, we propose a novel cross-view multi-object geo-localization method, MOGeo, and benchmark it against existing state-of-the-art methods. Extensive experiments are conducted under various application scenarios to validate the effectiveness of our method. The results demonstrate that cross-view object geo-localization in the more realistic setting remains a challenging problem, encouraging further research in this area. Our dataset and code will be released at \texttt https://github.com/LV-BO001/MOGeo .

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Lv_2026_CVPR, author = {Lv, Bo and Zhang, Qingwang and Wu, Le and Li, Yuanyuan and Zhu, Yingying}, title = {MOGeo: Beyond One-to-One Cross-View Object Geo-localization}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2026}, pages = {26422-26431} }