HSM: Hierarchical Scene Motifs for Multi-Scale Indoor Scene Generation

📅 2025-03-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing indoor 3D scene layout generation methods focus predominantly on large furniture, neglecting small objects—leading to sparse, geometrically distorted scenes that fail to satisfy dense spatial arrangements specified in text descriptions. To address this, we propose the Hierarchical Scene Primitive (HSP) framework, the first to explicitly model cross-scale spatial dependencies between surfaces and objects, as well as recurring layout patterns across scales, enabling coherent multi-scale generation—from floor-level furniture to tabletop small objects. Our approach integrates a hierarchical graph neural network with a conditional diffusion model to jointly encode surface semantics, geometric constraints, and multi-granularity layout priors. Evaluated across diverse room types and layout configurations, our method achieves significant improvements in visual plausibility and text-layout alignment over state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract
Despite advances in indoor 3D scene layout generation, synthesizing scenes with dense object arrangements remains challenging. Existing methods primarily focus on large furniture while neglecting smaller objects, resulting in unrealistically empty scenes. Those that place small objects typically do not honor arrangement specifications, resulting in largely random placement not following the text description. We present HSM, a hierarchical framework for indoor scene generation with dense object arrangements across spatial scales. Indoor scenes are inherently hierarchical, with surfaces supporting objects at different scales, from large furniture on floors to smaller objects on tables and shelves. HSM embraces this hierarchy and exploits recurring cross-scale spatial patterns to generate complex and realistic indoor scenes in a unified manner. Our experiments show that HSM outperforms existing methods by generating scenes that are more realistic and better conform to user input across room types and spatial configurations.
Problem

Research questions and friction points this paper is trying to address.

Generating dense indoor 3D scenes with realistic object arrangements
Addressing neglect of small objects in current scene generation methods
Ensuring object placement follows hierarchical and textual specifications
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical framework for multi-scale scene generation
Exploits recurring cross-scale spatial patterns
Unified generation of complex realistic indoor scenes
🔎 Similar Papers
2024-09-12arXiv.orgCitations: 9
Hou In Derek Pun
Hou In Derek Pun
Simon Fraser University
computer graphicsscene generation
Hou In Ivan Tam
Hou In Ivan Tam
Simon Fraser University
A
Austin T. Wang
Simon Fraser University
X
Xiaoliang Huo
Simon Fraser University
A
Angel X. Chang
Simon Fraser University, Alberta Machine Intelligence Institute (Amii)
M
M. Savva
Simon Fraser University