GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor

๐Ÿ“… 2025-01-17
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses critical challenges in text-driven 4D Gaussian avatar editingโ€”namely, severe motion occlusion, spatiotemporal inconsistency, and photometric distortion. Methodologically: (1) We propose a Weighted Alpha Blending Equation (WABE) to dynamically model geometric and visibility changes during motion, effectively mitigating occlusion artifacts; (2) we design a conditional Generative Adversarial Network (cGAN) to jointly optimize photorealism and 4D spatiotemporal consistency; (3) we integrate 3D Gaussian splatting, differentiable rendering, and a dynamic visibility-weighted fusion mechanism. Experiments demonstrate that our framework significantly outperforms existing methods on multi-subject editing tasks. The generated avatars exhibit fine-grained controllability over expressions, poses, and viewpoints; strong inter-frame temporal coherence; multi-view consistency; and high photorealism.

Technology Category

Application Category

๐Ÿ“ Abstract
We introduce GaussianAvatar-Editor, an innovative framework for text-driven editing of animatable Gaussian head avatars that can be fully controlled in expression, pose, and viewpoint. Unlike static 3D Gaussian editing, editing animatable 4D Gaussian avatars presents challenges related to motion occlusion and spatial-temporal inconsistency. To address these issues, we propose the Weighted Alpha Blending Equation (WABE). This function enhances the blending weight of visible Gaussians while suppressing the influence on non-visible Gaussians, effectively handling motion occlusion during editing. Furthermore, to improve editing quality and ensure 4D consistency, we incorporate conditional adversarial learning into the editing process. This strategy helps to refine the edited results and maintain consistency throughout the animation. By integrating these methods, our GaussianAvatar-Editor achieves photorealistic and consistent results in animatable 4D Gaussian editing. We conduct comprehensive experiments across various subjects to validate the effectiveness of our proposed techniques, which demonstrates the superiority of our approach over existing methods. More results and code are available at: [Project Link](https://xiangyueliu.github.io/GaussianAvatar-Editor/).
Problem

Research questions and friction points this paper is trying to address.

4D Gaussian Avatars
Object Occlusion
Temporal-Spatial Mismatch
Innovation

Methods, ideas, or system contributions that make the work stand out.

4D Gaussian Avatars
WABE formula
conditional adversarial learning
๐Ÿ”Ž Similar Papers
No similar papers found.
Xiangyue Liu
Xiangyue Liu
Hong Kong University of Science and Technology (HKUST)
MLLMComputer VisionRobotics
Kunming Luo
Kunming Luo
HKUST
computer vision
H
Heng Li
Hong Kong University of Science and Technology
Q
Qi Zhang
Tencent AI Lab
Y
Yuan Liu
Hong Kong University of Science and Technology
L
Li Yi
Tsinghua University
Ping Tan
Ping Tan
Hong Kong University of Science and Technology (HKUST)
Computer VisionComputer Graphics