Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets

๐Ÿ“… 2025-10-22
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Embodied AI training is hindered by a fundamental trade-off in world simulators between content diversity and physical fidelity: video-based generation lacks real-time physical feedback, while physics engines rely on labor-intensive manual modeling and suffer from poor scalability. This paper introduces the first end-to-end framework that synthesizes simulation-ready 3D assets from a single input imageโ€”yielding high-fidelity geometry, texture-geometry alignment, and physically consistent material parameters directly compatible with mainstream physics engines (e.g., PyBullet, MuJoCo). Our approach unifies foundation models, neural 3D reconstruction, inverse rendering, and physics-aware material parameter estimation. Experiments demonstrate that the generated assets enable photorealistic robotic manipulation simulation and large-scale scene construction, significantly improving both content generation efficiency and diversity without compromising physical plausibility. The code and models are publicly released.

Technology Category

Application Category

๐Ÿ“ Abstract
Developing embodied AI agents requires scalable training environments that balance content diversity with physics accuracy. World simulators provide such environments but face distinct limitations: video-based methods generate diverse content but lack real-time physics feedback for interactive learning, while physics-based engines provide accurate dynamics but face scalability limitations from costly manual asset creation. We present Seed3D 1.0, a foundation model that generates simulation-ready 3D assets from single images, addressing the scalability challenge while maintaining physics rigor. Unlike existing 3D generation models, our system produces assets with accurate geometry, well-aligned textures, and realistic physically-based materials. These assets can be directly integrated into physics engines with minimal configuration, enabling deployment in robotic manipulation and simulation training. Beyond individual objects, the system scales to complete scene generation through assembling objects into coherent environments. By enabling scalable simulation-ready content creation, Seed3D 1.0 provides a foundation for advancing physics-based world simulators. Seed3D 1.0 is now available on https://console.volcengine.com/ark/region:ark+cn-beijing/experience/vision?modelId=doubao-seed3d-1-0-250928&tab=Gen3D
Problem

Research questions and friction points this paper is trying to address.

Generates simulation-ready 3D assets from single images
Addresses scalability limitations of manual physics asset creation
Produces assets with accurate geometry and physics-compatible materials
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generates simulation-ready 3D assets from single images
Produces assets with accurate geometry and aligned textures
Enables direct integration into physics engines with minimal configuration
๐Ÿ”Ž Similar Papers
No similar papers found.
Jiashi Feng
Jiashi Feng
ByteDance Inc.
computer visionmachine learning
Xiu Li
Xiu Li
Bytedance Seed
Computer VisionComputer Graphics3D Vision
J
Jing Lin
ByteDance Seed
J
Jiahang Liu
ByteDance Seed
G
Gaohong Liu
ByteDance Seed
W
Weiqiang Lou
ByteDance Seed
S
Su Ma
ByteDance Seed
G
Guang Shi
ByteDance Seed
Q
Qinlong Wang
ByteDance Seed
J
Jun Wang
ByteDance Seed
Zhongcong Xu
Zhongcong Xu
National University of Singapore
Computer vision
Xuanyu Yi
Xuanyu Yi
ByteDance Seed
3D VisionGenerative Model
Zihao Yu
Zihao Yu
University of Science and Technology of China
J
Jianfeng Zhang
ByteDance Seed
Yifan Zhu
Yifan Zhu
Beijing University of Posts and Telecommunications
PEFT of LLMsGraph RAGGraph mining
R
Rui Chen
ByteDance Seed
J
Jinxin Chi
ByteDance Seed
Z
Zixian Du
ByteDance Seed
L
Li Han
ByteDance Seed
L
Lixin Huang
ByteDance Seed
K
Kaihua Jiang
ByteDance Seed
Y
Yuhan Li
ByteDance Seed
Guan Luo
Guan Luo
Tsinghua University
3D generation
S
Shuguang Wang
ByteDance Seed
Qianyi Wu
Qianyi Wu
Monash University
Computer VisionComputer Graphics3D Vision
F
Fan Yang
ByteDance Seed
Junyang Zhang
Junyang Zhang
California Institute of Technology, Stanford University, University of California, Irvine
machine learning and ML systemroboticsdigital designsemiconductorintegrated circuits
Xuanmeng Zhang
Xuanmeng Zhang
University of Technology Sydney
computer vision