Diff-PC: Identity-preserving and 3D-aware controllable diffusion for zero-shot portrait customization

📅 2024-12-01
🏛️ Information Fusion
📈 Citations: 6
Influential: 0
📄 PDF
🤖 AI Summary
Existing portrait customization methods often struggle to simultaneously preserve identity fidelity and achieve precise facial control. To address this limitation, this work proposes Diff-PC, a novel framework that leverages a 3D face-guided identity encoder and a feature injection mechanism to enable high-fidelity, fine-grained controllable portrait generation under zero-shot conditions. The approach effectively integrates 3D-aware priors with identity features and is compatible with diverse backgrounds and multi-style base models. Trained with a dedicated identity-centric dataset and enhanced by an ID-Encoder, an ID-Ctrl alignment module, and an ID-Injector refinement module, Diff-PC consistently outperforms state-of-the-art methods in terms of identity preservation, facial controllability, and text-to-image alignment.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

portrait customization
identity preservation
facial control
zero-shot generation
3D-aware
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D-aware diffusion
identity preservation
zero-shot portrait customization
facial controllability
ID-Encoder
🔎 Similar Papers
No similar papers found.
Y
Yifang Xu
School of Electronic Science and Engineering, Nanjing University, Nanjing, 210036, China
B
Benxiang Zhai
School of Electronic Science and Engineering, Nanjing University, Nanjing, 210036, China
C
Chenyu Zhang
School of Business, Hohai University, Nanjing, 211100, China
Ming Li
Ming Li
Nanjing University
Machine LearningData Mining
Y
Yang Li
School of Electronic Science and Engineering, Nanjing University, Nanjing, 210036, China
Sidan Du
Sidan Du
Nanjing University
Image Processing and ControlMachine Learning