🤖 AI Summary
This work addresses the challenge of high-fidelity, training-free 3D shape-and-texture morphing—bypassing conventional reliance on manual correspondence establishment and explicit deformation trajectory estimation. The proposed method formulates morphing as an optimal transport–based barycentric interpolation problem. It integrates a streaming Transformer to generate geometric priors, similarity-guided semantic consistency constraints, and a progressive sequence initialization strategy. Given only source/target text or image prompts, it achieves joint, smooth geometric and textural transitions without registration or supervision, effectively suppressing oversmoothing artifacts. Qualitative and quantitative evaluations across diverse shape-texture co-morphing tasks demonstrate superior performance over existing unsupervised and weakly supervised approaches. To our knowledge, this is the first method enabling purely prompt-driven, end-to-end consistent, high-quality 3D semantic morphing.
📝 Abstract
We present WUKONG, a novel training-free framework for high-fidelity textured 3D morphing that takes a pair of source and target prompts (image or text) as input. Unlike conventional methods -- which rely on manual correspondence matching and deformation trajectory estimation (limiting generalization and requiring costly preprocessing) -- WUKONG leverages the generative prior of flow-based transformers to produce high-fidelity 3D transitions with rich texture details. To ensure smooth shape transitions, we exploit the inherent continuity of flow-based generative processes and formulate morphing as an optimal transport barycenter problem. We further introduce a sequential initialization strategy to prevent abrupt geometric distortions and preserve identity coherence. For faithful texture preservation, we propose a similarity-guided semantic consistency mechanism that selectively retains high-frequency details and enables precise control over blending dynamics. This avoids common artifacts like oversmoothing while maintaining semantic fidelity. Extensive quantitative and qualitative evaluations demonstrate that WUKONG significantly outperforms state-of-the-art methods, achieving superior results across diverse geometry and texture variations.