Cube: A Roblox View of 3D Intelligence

📅 2025-03-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the lack of foundational models for 3D intelligence in the Roblox ecosystem. To tackle the challenge of modeling complex 3D geometric structures, we propose the first geometry-aware 3D shape tokenizer, enabling cross-modal semantic alignment among text, shape, and scene representations. We establish three design principles for 3D foundation models and introduce an LLM-3D collaborative reasoning framework alongside a unified text-to-shape/scene joint generation architecture. Our approach unifies support for 3D object/scene generation, character rigging, and behavioral script synthesis. Experiments demonstrate significant improvements in fidelity across text-to-shape, shape-to-text, and text-to-scene generation tasks. Moreover, the model supports cross-modal understanding and logical reasoning over 3D content. By providing a scalable, general-purpose intelligent foundation, this work advances programmable 3D content creation within Roblox and beyond.

Technology Category

Application Category

📝 Abstract
Foundation models trained on vast amounts of data have demonstrated remarkable reasoning and generation capabilities in the domains of text, images, audio and video. Our goal at Roblox is to build such a foundation model for 3D intelligence, a model that can support developers in producing all aspects of a Roblox experience, from generating 3D objects and scenes to rigging characters for animation to producing programmatic scripts describing object behaviors. We discuss three key design requirements for such a 3D foundation model and then present our first step towards building such a model. We expect that 3D geometric shapes will be a core data type and describe our solution for 3D shape tokenizer. We show how our tokenization scheme can be used in applications for text-to-shape generation, shape-to-text generation and text-to-scene generation. We demonstrate how these applications can collaborate with existing large language models (LLMs) to perform scene analysis and reasoning. We conclude with a discussion outlining our path to building a fully unified foundation model for 3D intelligence.
Problem

Research questions and friction points this paper is trying to address.

Develop a foundation model for 3D intelligence in Roblox.
Support developers in generating 3D objects, scenes, and animations.
Integrate 3D shape tokenization with text-to-shape and scene generation.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Develops 3D foundation model for Roblox
Introduces 3D shape tokenizer solution
Integrates with LLMs for scene analysis
🔎 Similar Papers
No similar papers found.
F
Foundation AI Team Roblox Kiran Bhat
Foundation AI team, Roblox
N
Nishchaie Khanna
Foundation AI team, Roblox
K
Karun Channa
Foundation AI team, Roblox
Tinghui Zhou
Tinghui Zhou
Roblox, Foundation AI
Computer Vision/GraphicsMachine Learning
Yiheng Zhu
Yiheng Zhu
Zhongguancun Academy & Zhongguancun Institute of Artificial Intelligence
AI for ScienceDeep generative modelsProtein designDrug discovery
X
Xiaoxia Sun
Foundation AI team, Roblox
C
Charles Shang
Foundation AI team, Roblox
A
Anirudh Sudarshan
Foundation AI team, Roblox
M
Maurice Chu
Foundation AI team, Roblox
Daiqing Li
Daiqing Li
University of Toronto, Roblox, ex Playground, NVIDIA Research
computer visioncomputer graphics
K
Kangle Deng
Foundation AI team, Roblox
J
Jean-Philippe Fauconnier
Foundation AI team, Roblox
T
Tijmen Verhulsdonck
Foundation AI team, Roblox
Maneesh Agrawala
Maneesh Agrawala
Stanford University
GraphicsComputer GraphicsHCIVisualization
Kayvon Fatahalian
Kayvon Fatahalian
Associate Professor of Computer Science, Stanford University
Computer GraphicsSystems
Alexander Weiss
Alexander Weiss
Brown University
Computer Vision
C
Christian Reiser
Foundation AI team, Roblox
R
Ravi Kiran Chirravuri
Foundation AI team, Roblox
R
Ravali Kandur
Foundation AI team, Roblox
A
Alejandro Pelaez
Foundation AI team, Roblox
A
Akash Garg
Foundation AI team, Roblox
M
Michael Palleschi
Foundation AI team, Roblox
Jessica Wang
Jessica Wang
Professor of U.S. History, University of British Columbia
U.S. political historystate powerhistory of sciencehistory of medicine
S
Skylar Litz
Foundation AI team, Roblox
L
Leon Liu
Foundation AI team, Roblox
A
Anying Li
Foundation AI team, Roblox
D
David Harmon
Foundation AI team, Roblox
D
Derek Liu
Foundation AI team, Roblox
L
Liangjun Feng
Foundation AI team, Roblox
D
Denis Goupil
Foundation AI team, Roblox
L
Lukas Kuczynski
Foundation AI team, Roblox
J
Jihyun Yoon
Foundation AI team, Roblox
N
Naveen Marri
Foundation AI team, Roblox
P
Peiye Zhuang
Foundation AI team, Roblox
Yinan Zhang
Yinan Zhang
Zhejiang University
Digital twinsChemical Engineering Modeling
B
Brian Yin
Foundation AI team, Roblox
H
Haomiao Jiang
Foundation AI team, Roblox
M
Marcel van Workum
Foundation AI team, Roblox
T
Thomas Lane
Foundation AI team, Roblox
B
Bryce Erickson
Foundation AI team, Roblox
S
Salil Pathare
Foundation AI team, Roblox
K
Kyle Price
Foundation AI team, Roblox
A
Anupam Singh
Foundation AI team, Roblox
D
David Baszucki
Foundation AI team, Roblox