Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

πŸ“… 2025-01-21
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing 3D generative methods suffer from significant bottlenecks in geometric detail fidelity and texture realism. To address this, we introduce Hunyuan3Dβ€”the first end-to-end high-resolution textured 3D asset generation framework. It comprises two core components: (1) Hunyuan3D-DiT, a geometry generation model based on a scalable streaming diffusion Transformer for precise multimodal conditional alignment; and (2) Hunyuan3D-Paint, a texture synthesis model integrating geometric priors with 3D-aware diffusion priors to enable high-fidelity UV mapping at resolutions β‰₯1024Γ—1024. Complementing these, Hunyuan3D-Studio is an open-source platform supporting interactive editing and animation. Extensive evaluation demonstrates state-of-the-art performance across geometric accuracy, conditional consistency, and texture photorealism. We fully open-source the models, training code, and pre-trained weights, establishing the first large-scale foundational 3D generative model ecosystem.

Technology Category

Application Category

πŸ“ Abstract
We present Hunyuan3D 2.0, an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets. This system includes two foundation components: a large-scale shape generation model -- Hunyuan3D-DiT, and a large-scale texture synthesis model -- Hunyuan3D-Paint. The shape generative model, built on a scalable flow-based diffusion transformer, aims to create geometry that properly aligns with a given condition image, laying a solid foundation for downstream applications. The texture synthesis model, benefiting from strong geometric and diffusion priors, produces high-resolution and vibrant texture maps for either generated or hand-crafted meshes. Furthermore, we build Hunyuan3D-Studio -- a versatile, user-friendly production platform that simplifies the re-creation process of 3D assets. It allows both professional and amateur users to manipulate or even animate their meshes efficiently. We systematically evaluate our models, showing that Hunyuan3D 2.0 outperforms previous state-of-the-art models, including the open-source models and closed-source models in geometry details, condition alignment, texture quality, and etc. Hunyuan3D 2.0 is publicly released in order to fill the gaps in the open-source 3D community for large-scale foundation generative models. The code and pre-trained weights of our models are available at: https://github.com/Tencent/Hunyuan3D-2
Problem

Research questions and friction points this paper is trying to address.

3D model generation
detail processing
texture handling
Innovation

Methods, ideas, or system contributions that make the work stand out.

High-Detail 3D Modeling
Texture Enhancement
User-Friendly Platform
πŸ”Ž Similar Papers
No similar papers found.
Zibo Zhao
Zibo Zhao
Hunyuan, Tencent; ShanghaiTech
Zeqiang Lai
Zeqiang Lai
CUHK | Tencent | BIT
Low Level VisionGenerated ModelsProximal Algorithm
Q
Qingxiang Lin
Yunfei Zhao
Yunfei Zhao
Peking University
intelligent programcode generationcode representation
H
Haolin Liu
Shuhui Yang
Shuhui Yang
Tencent
Y
Yifei Feng
Mingxin Yang
Mingxin Yang
Peking University
Cyber SecurityLLMEncrypted Traffic Analysis
S
Sheng Zhang
X
Xianghui Yang
H
Huiwen Shi
S
Sicong Liu
J
Junta Wu
Y
Yihang Lian
F
Fan Yang
R
Ruining Tang
Z
Zebin He
Xinzhou Wang
Xinzhou Wang
Tsinghua University, Tongji University
AIGC3D GenerationVideo Generation
J
Jian Liu
X
Xuhui Zuo
Z
Zhuo Chen
Biwen Lei
Biwen Lei
Tencent
Computer VisionDeep Learning
Haohan Weng
Haohan Weng
South China University of Technology
generative modelscomputer vision
J
Jing Xu
Y
Yiling Zhu
Xinhai Liu
Xinhai Liu
Tencent, Tencent.com, China
3D Computer Vision3D Reconstruction3D Generation
L
Lixin Xu
C
Changrong Hu
T
Tianyu Huang
L
Lifu Wang
J
Jihong Zhang
M
Meng Chen
L
Liang Dong
Y
Yiwen Jia
Y
Yulin Cai
J
Jiaao Yu
Y
Yixuan Tang
H
Hao Zhang
Z
Zheng Ye
P
Peng He
R
Runzhou Wu
C
Chao Zhang
Yonghao Tan
Yonghao Tan
The Hong Kong University of Science and Technology
AI AcceleratorComputer VisionVLSI
Jie Xiao
Jie Xiao
University of Science and Technology of China
low level visiongenerative modelmachine learning
Y
Yangyu Tao
J
Jianchen Zhu
J
Jinbao Xue
K
Kai Liu
C
Chongqing Zhao
X
Xinming Wu
Z
Zhichao Hu
L
Lei Qin
J
Jianbing Peng
Zhan Li
Zhan Li
M
Minghui Chen
X
Xipeng Zhang
L
Lin Niu
P
Paige Wang
Y
Yingkai Wang
H
Hao Kuang
Z
Zhongyi Fan
X
Xu Zheng
W
Weihao Zhuang
Y
YingPing He
T
Tian Liu
Y
Yong Yang
D
Di Wang
Yuhong Liu
Yuhong Liu
Santa Clara University
Trustworthy AISecurity and PrivacyIoTBlockchainSocial network
J
Jie Jiang
J
Jingwei Huang
C
Chunchao Guo