Enhancing 2D Representation via Adjacent Views for 3D Shape Retrieval

Cheng Xu, Zhaoqun Li, Qiang Qiu, Biao Leng, Jingfei Jiang; The IEEE International Conference on Computer Vision (ICCV), 2019, pp. 3732-3740


Multi-view shape descriptors obtained from various 2D images are commonly adopted in 3D shape retrieval. One major challenge is that significant shape information are discarded during 2D view rendering through projection. In this paper, we propose a convolutional neural network based method, CenterNet, to enhance each individual 2D view using its neighboring ones. By exploiting cross-view correlations, CenterNet learns how adjacent views can be maximally incorporated for an enhanced 2D representation to effectively describe shapes. We observe that a very small amount of, e.g., six, enhanced 2D views, are already sufficient for a panoramic shape description. Thus, by simply aggregating features from six enhanced 2D views, we arrive at a highly compact yet discriminative shape descriptor. The proposed shape descriptor significantly outperforms state-of-the-art 3D shape retrieval methods on the ModelNet and ShapeNetCore55 benchmarks, and also exhibits robustness against object occlusion.

Related Material

[pdf] [supp]
author = {Xu, Cheng and Li, Zhaoqun and Qiu, Qiang and Leng, Biao and Jiang, Jingfei},
title = {Enhancing 2D Representation via Adjacent Views for 3D Shape Retrieval},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {October},
year = {2019}