Giving Faces Their Feelings Back: Explicit Emotion Control for Feedforward Single-Image 3D Head Avatars

📅 2026-04-15
📈 Citations: 0
Influential: 0
📄 PDF

career value

224K/year
🤖 AI Summary
Existing single-image 3D face reconstruction methods typically embed emotion implicitly within geometry or appearance, making it difficult to achieve consistent emotional control across identities. This work proposes a dual-path modulation mechanism that, without altering existing feed-forward architectures, introduces emotion as an explicit and independent primary control signal into the reconstruction pipeline for the first time. By combining geometric modulation—via emotion-conditioned normalization—and appearance modulation—designed to capture identity-aware emotional visual cues—the approach effectively disentangles emotion from speech-driven facial dynamics and enables cross-identity emotion transfer. Integrated into multiple state-of-the-art backbone networks, the method maintains high-fidelity reconstruction and reenactment capabilities while supporting controllable emotion transfer, smooth interpolation, and disentangled manipulation.

Technology Category

Application Category

📝 Abstract
We present a framework for explicit emotion control in feed-forward, single-image 3D head avatar reconstruction. Unlike existing pipelines where emotion is implicitly entangled with geometry or appearance, we treat emotion as a first-class control signal that can be manipulated independently and consistently across identities. Our method injects emotion into existing feed-forward architectures via a dual-path modulation mechanism without modifying their core design. Geometry modulation performs emotion-conditioned normalization in the original parametric space, disentangling emotional state from speech-driven articulation, while appearance modulation captures identity-aware, emotion-dependent visual cues beyond geometry. To enable learning under this setting, we construct a time-synchronized, emotion-consistent multi-identity dataset by transferring aligned emotional dynamics across identities. Integrated into multiple state-of-the-art backbones, our framework preserves reconstruction and reenactment fidelity while enabling controllable emotion transfer, disentangled manipulation, and smooth emotion interpolation, advancing expressive and scalable 3D head avatars.
Problem

Research questions and friction points this paper is trying to address.

emotion control
3D head avatars
single-image reconstruction
disentanglement
feedforward generation
Innovation

Methods, ideas, or system contributions that make the work stand out.

explicit emotion control
feed-forward 3D avatar
dual-path modulation
emotion-geometry disentanglement
emotion-consistent dataset
🔎 Similar Papers
No similar papers found.