GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation

📅 2023-12-04
📈 Citations: 21
Influential: 3
📄 PDF
🤖 AI Summary
This paper addresses key challenges in high-fidelity 3D head modeling and animation—namely, difficulty in modeling dynamic geometry, complex texture representation, and low rendering efficiency. We propose an anisotropic 3D Gaussian-based avatar representation. Our core contributions are: (1) a learnable Gaussian derivation mechanism that generates multiple “clone” Gaussians via spatial transformation to compactly encode fine-grained head structures; (2) a Gaussian inheritance derivation strategy to accelerate training convergence; and (3) joint optimization of geometry and appearance through integration of a motion deformation field, multi-resolution triplane texture representation, and learnable spatial transformations. Extensive experiments demonstrate state-of-the-art performance across reconstruction accuracy, cross-identity reenactment, and novel-view synthesis—achieving significantly enhanced realism and controllability while maintaining real-time rendering efficiency.
📝 Abstract
Constructing vivid 3D head avatars for given subjects and realizing a series of animations on them is valuable yet challenging. This paper presents GaussianHead, which models the actional human head with anisotropic 3D Gaussians. In our framework, a motion deformation field and multi-resolution tri-plane are constructed respectively to deal with the head's dynamic geometry and complex texture. Notably, we impose an exclusive derivation scheme on each Gaussian, which generates its multiple doppelgangers through a set of learnable parameters for position transformation. With this design, we can compactly and accurately encode the appearance information of Gaussians, even those fitting the head's particular components with sophisticated structures. In addition, an inherited derivation strategy for newly added Gaussians is adopted to facilitate training acceleration. Extensive experiments show that our method can produce high-fidelity renderings, outperforming state-of-the-art approaches in reconstruction, cross-identity reenactment, and novel view synthesis tasks. Our code is available at: https://github.com/chiehwangs/gaussian-head.
Problem

Research questions and friction points this paper is trying to address.

Constructing vivid 3D head avatars for animation
Modeling dynamic geometry and complex texture
Achieving high-fidelity rendering and reconstruction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Models head with anisotropic 3D Gaussians
Uses motion deformation and tri-plane
Learnable parameters for position transformation
🔎 Similar Papers
No similar papers found.
J
Jie Wang
School of Automation and the School of Artificial Intelligence, Nanjing University of Posts and Telecommunications
J
Jiucheng Xie
School of Automation and the School of Artificial Intelligence, Nanjing University of Posts and Telecommunications
X
Xianyan Li
School of Automation and the School of Artificial Intelligence, Nanjing University of Posts and Telecommunications
F
Feng Xu
School of Software and BNRist, Tsinghua University
Chi-Man Pun
Chi-Man Pun
Professor of Computer and Information Science, University of Macau
Image ProcessingPattern RecognitionMultimedia and AI SecurityMedical Image Analysis
H
Hao Gao
School of Automation and the School of Artificial Intelligence, Nanjing University of Posts and Telecommunications