Teeth Reconstruction and Performance Capture Using a Phone Camera

Weixi Zheng, Jingwang Ling, Zhibo Wang, Quan Wang, Feng Xu; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 9998-10008

Abstract


We present the first method for personalized dental shape reconstruction and teeth-inclusive facial performance capture using only a single phone camera. Our approach democratizes high-quality facial avatars through a non-invasive, low-cost setup by addressing the ill-posed monocular capture problem with an analysis-by-synthesis approach. We introduce a representation adaptation technique that maintains both mesh and SDF representations of teeth, enabling efficient differentiable rendering while preventing teeth-lip interpenetration. To overcome alignment challenges with similar-appearing dental components, we leverage foundation models for semantic teeth segmentation and design specialized optimization objectives. Our method addresses the challenging occlusions of teeth during facial performance through optimization strategies that leverage facial structural priors, while our semantic mask rendering loss with optimal transport-based matching ensures convergence despite significant variations in initial positioning.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Zheng_2025_ICCV, author = {Zheng, Weixi and Ling, Jingwang and Wang, Zhibo and Wang, Quan and Xu, Feng}, title = {Teeth Reconstruction and Performance Capture Using a Phone Camera}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {9998-10008} }