DG-Recon: Depth-Guided Neural 3D Scene Reconstruction

Jihong Ju, Ching Wei Tseng, Oleksandr Bailo, Georgi Dikov, Mohsen Ghafoorian; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 18184-18194

Abstract


A key challenge in neural 3D scene reconstruction from monocular images is to fuse features back projected from various views without any depth or occlusion information. We address this by leveraging monocular depth priors, which effectively guide the fusion to improve surface prediction and skip over irrelevant, ambiguous, or occluded features. Furthermore, we revisit the average-based fusion used by most neural 3D reconstruction methods and propose two alternatives, a variance-based and a cross-attention-based fusion module, that are more efficient and effective than the average-based and self-attention-based counterparts. Compared to the NeuralRecon baseline, the proposed DG-Recon models significantly improve the reconstruction quality and completeness while remaining in real-time. Our method achieves state-of-the-art online reconstruction results on the ScanNet dataset and is on par with the current best offline method, which repeatedly accesses keyframes from the entire video sequence. Our ScanNet-trained model also generalizes robustly to the challenging 7-Scenes dataset and a subset of SUN3D containing scenes as big as an entire floor.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Ju_2023_ICCV, author = {Ju, Jihong and Tseng, Ching Wei and Bailo, Oleksandr and Dikov, Georgi and Ghafoorian, Mohsen}, title = {DG-Recon: Depth-Guided Neural 3D Scene Reconstruction}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {18184-18194} }