CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts

📅 2025-03-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses key challenges in large-scale indoor digital twin construction—namely, object collisions, structural incoherence, and poor floorplan adaptability during 3D scene generation. To this end, we propose an end-to-end generative framework leveraging 2D layout images as an intermediate representation. Methodologically, we introduce the first approach that exploits 2D layout encoding to detect out-of-distribution (OOD) anomalies, thereby fundamentally mitigating collisions; integrate conditional diffusion models with scene graph decoders to produce geometrically collision-free, semantically structured, and whole-apartment-coherent layouts; and support both floorplan-driven and multimodal (text/sketch) joint control. Our contributions include: (1) the first OOD-aware, collision-robust generative mechanism; (2) a high-quality, large-coverage indoor layout dataset; and (3) state-of-the-art performance on 3D-FRONT and our proprietary dataset, yielding high-fidelity, spatially consistent, and editable apartment-level 3D scenes.

Technology Category

Application Category

📝 Abstract
We introduce CHOrD, a novel framework for scalable synthesis of 3D indoor scenes, designed to create house-scale, collision-free, and hierarchically structured indoor digital twins. In contrast to existing methods that directly synthesize the scene layout as a scene graph or object list, CHOrD incorporates a 2D image-based intermediate layout representation, enabling effective prevention of collision artifacts by successfully capturing them as out-of-distribution (OOD) scenarios during generation. Furthermore, unlike existing methods, CHOrD is capable of generating scene layouts that adhere to complex floor plans with multi-modal controls, enabling the creation of coherent, house-wide layouts robust to both geometric and semantic variations in room structures. Additionally, we propose a novel dataset with expanded coverage of household items and room configurations, as well as significantly improved data quality. CHOrD demonstrates state-of-the-art performance on both the 3D-FRONT and our proposed datasets, delivering photorealistic, spatially coherent indoor scene synthesis adaptable to arbitrary floor plan variations.
Problem

Research questions and friction points this paper is trying to address.

Generates collision-free, house-scale 3D indoor digital twins.
Incorporates 2D image-based layout to prevent collision artifacts.
Creates coherent layouts adaptable to complex floor plan variations.
Innovation

Methods, ideas, or system contributions that make the work stand out.

2D image-based intermediate layout representation
Multi-modal controls for complex floor plans
Novel dataset with expanded household item coverage
🔎 Similar Papers
No similar papers found.
C
Chong Su
KE Holdings Inc.
Y
Yingbin Fu
KE Holdings Inc.
Z
Zheyuan Hu
Department of Computer Science and Technology, University of Cambridge
J
Jing Yang
Department of Computer Science and Technology, University of Cambridge
Param Hanji
Param Hanji
University of Cambridge
computer visionmachine learningcomputational photography
Shaojun Wang
Shaojun Wang
Soochow University, TU/e, University of Strasbourg
NanophotonicsLight-matter interactionsNanofabrication
Xuan Zhao
Xuan Zhao
PhD, Forschungszentrum Jülich GmbH
XAIFair AI
C
Cengiz Oztireli
Department of Computer Science and Technology, University of Cambridge
F
Fangcheng Zhong
Department of Computer Science and Technology, University of Cambridge