🤖 AI Summary
Existing multi-view diffusion methods often struggle to generate high-fidelity 3D textures due to view inconsistency, UV mapping distortions, and limitations imposed by point cloud density. To address these challenges, this work proposes TexSpot, a novel framework that introduces Texlet—a spatially uniform latent point representation that decouples texture quality from geometric density. TexSpot integrates a 2D encoder to capture local texture details and a 3D encoder to incorporate global shape context, followed by a cascaded 3D-to-2D decoder for texture reconstruction. A diffusion Transformer, conditioned on Texlets, is then trained to enhance texture fidelity. Experimental results demonstrate that TexSpot significantly outperforms current state-of-the-art methods in terms of visual fidelity, geometric consistency, and robustness.
📝 Abstract
High-quality 3D texture generation remains a fundamental challenge due to the view-inconsistency inherent in current mainstream multi-view diffusion pipelines. Existing representations either rely on UV maps, which suffer from distortion during unwrapping, or point-based methods, which tightly couple texture fidelity to geometric density that limits high-resolution texture generation. To address these limitations, we introduce TexSpot, a diffusion-based texture enhancement framework. At its core is Texlet, a novel 3D texture representation that merges the geometric expressiveness of point-based 3D textures with the compactness of UV-based representation. Each Texlet latent vector encodes a local texture patch via a 2D encoder and is further aggregated using a 3D encoder to incorporate global shape context. A cascaded 3D-to-2D decoder reconstructs high-quality texture patches, enabling the Texlet space learning. Leveraging this representation, we train a diffusion transformer conditioned on Texlets to refine and enhance textures produced by multi-view diffusion methods. Extensive experiments demonstrate that TexSpot significantly improves visual fidelity, geometric consistency, and robustness over existing state-of-the-art 3D texture generation and enhancement approaches. Project page: https://anonymous.4open.science/w/TexSpot-page-2D91.