GUAVA: Generalizable Upper Body 3D Gaussian Avatar

Zhang, Dongbin; Liu, Yunfei; Lin, Lijian; Zhu, Ye; Li, Yang; Qin, Minghan; Li, Yu; Wang, Haoqian

Dongbin Zhang, Yunfei Liu, Lijian Lin, Ye Zhu, Yang Li, Minghan Qin, Yu Li, Haoqian Wang; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 14205-14217

Abstract

Reconstructing a high-quality, animatable 3D human avatar with expressive facial and hand motions from a single image has gained significant attention due to its broad application potential. 3D human avatar reconstruction typically requires multi-view or monocular videos and training on individual IDs, which is both complex and time-consuming. Furthermore, limited by SMPLX's expressiveness, these methods often focus on body motion but struggle with facial expressions. To address these challenges, we first introduce an expressive human model (EHM) to enhance facial expression capabilities and develop an accurate tracking method. Based on this template model, we propose GUAVA, the first framework for fast animatable upper-body 3D Gaussian avatar reconstruction. We leverage inverse texture mapping and projection sampling techniques to infer Ubody (upper-body) Gaussians from a single image. The rendered images are refined through a neural refiner. Experimental results demonstrate that GUAVA significantly outperforms previous methods in rendering quality and offers significant speed improvements, with reconstruction times in the sub-second range ( 0.1s), and supports real-time animation and rendering.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Zhang_2025_ICCV, author = {Zhang, Dongbin and Liu, Yunfei and Lin, Lijian and Zhu, Ye and Li, Yang and Qin, Minghan and Li, Yu and Wang, Haoqian}, title = {GUAVA: Generalizable Upper Body 3D Gaussian Avatar}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {14205-14217} }