SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

📅 2025-07-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address conditional guidance bias in consistency distillation (CD) for text-to-3D generation—arising from imbalance between self-consistency and cross-consistency—this paper proposes Segmented Consistency Trajectory Distillation (SCTD). SCTD partitions the probability flow ordinary differential equation (ODE) trajectory into multiple segments, enforcing consistency constraints independently within each segment; theoretically, this yields a tighter upper bound on distillation error. Integrated with score-based distillation sampling, consistency-distillation optimization, and 3D Gaussian Splatting (3DGS) rendering, SCTD establishes an efficient and stable end-to-end generation pipeline. Experiments demonstrate that our method surpasses current state-of-the-art approaches in visual quality, geometric accuracy, and detail fidelity. It significantly enhances controllability and realism in text-driven 3D content generation.

Technology Category

Application Category

📝 Abstract
Recent advancements in text-to-3D generation improve the visual quality of Score Distillation Sampling (SDS) and its variants by directly connecting Consistency Distillation (CD) to score distillation. However, due to the imbalance between self-consistency and cross-consistency, these CD-based methods inherently suffer from improper conditional guidance, leading to sub-optimal generation results. To address this issue, we present SegmentDreamer, a novel framework designed to fully unleash the potential of consistency models for high-fidelity text-to-3D generation. Specifically, we reformulate SDS through the proposed Segmented Consistency Trajectory Distillation (SCTD), effectively mitigating the imbalance issues by explicitly defining the relationship between self- and cross-consistency. Moreover, SCTD partitions the Probability Flow Ordinary Differential Equation (PF-ODE) trajectory into multiple sub-trajectories and ensures consistency within each segment, which can theoretically provide a significantly tighter upper bound on distillation error. Additionally, we propose a distillation pipeline for a more swift and stable generation. Extensive experiments demonstrate that our SegmentDreamer outperforms state-of-the-art methods in visual quality, enabling high-fidelity 3D asset creation through 3D Gaussian Splatting (3DGS).
Problem

Research questions and friction points this paper is trying to address.

Address imbalance in self- and cross-consistency for 3D generation
Improve fidelity in text-to-3D synthesis via segmented distillation
Enhance visual quality and stability in 3D asset creation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Segmented Consistency Trajectory Distillation for balance
Partitioned PF-ODE trajectory for tighter error bound
Enhanced distillation pipeline for swift stable generation
🔎 Similar Papers
No similar papers found.
Jiahao Zhu
Jiahao Zhu
Sun Yat-sen University
Diffusion modelAI Security
Z
Zixuan Chen
School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
Guangcong Wang
Guangcong Wang
Assistant Professor, Great Bay University
Machine LearningDeep Learning3D VisionAI4Science
X
Xiaohua Xie
School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
Y
Yi Zhou
Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China