HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model

๐Ÿ“… 2024-06-28
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 1
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the problem of generating high-fidelity, structurally coherent large-scale 3D indoor scenes (e.g., whole houses) from 2D floor plans. Methodologically, it pioneers the adaptation of web-scale pre-trained 2D diffusion models to 3D scene generation, introducing a floor-plan-conditioned encoding scheme and an attention-driven consistency modeling mechanism. It autoregressively synthesizes multi-view-consistent RGB-D images while incorporating global floor-plan guidance to ensure spatial coherence. The key contribution lies in leveraging 2D diffusion priors for joint cross-view geometric and appearance modelingโ€”enabling high-fidelity texture synthesis and plausible structural layout without any 3D supervision. Experiments on the 3D-Front dataset demonstrate that the generated house-level scenes significantly outperform existing methods in structural plausibility, visual realism, and multi-view consistency. Ablation studies validate the efficacy of each proposed component.

Technology Category

Application Category

๐Ÿ“ Abstract
We introduce HouseCrafter, a novel approach that can lift a floorplan into a complete large 3D indoor scene (e.g., a house). Our key insight is to adapt a 2D diffusion model, which is trained on web-scale images, to generate consistent multi-view color (RGB) and depth (D) images across different locations of the scene. Specifically, the RGB-D images are generated autoregressively in a batch-wise manner along sampled locations based on the floorplan, where previously generated images are used as condition to the diffusion model to produce images at nearby locations. The global floorplan and attention design in the diffusion model ensures the consistency of the generated images, from which a 3D scene can be reconstructed. Through extensive evaluation on the 3D-Front dataset, we demonstrate that HouseCraft can generate high-quality house-scale 3D scenes. Ablation studies also validate the effectiveness of different design choices. We will release our code and model weights. Project page: https://neu-vi.github.io/houseCrafter/
Problem

Research questions and friction points this paper is trying to address.

Lifting 2D floorplans to 3D indoor scenes
Generating consistent multi-view RGB-D images
Reconstructing 3D scenes from diffusion-generated imagery
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adapts 2D diffusion model for 3D scene generation
Uses autoregressive RGB-D image generation with conditioning
Ensures consistency through floorplan-guided attention design
๐Ÿ”Ž Similar Papers
No similar papers found.