INHerit-SG: Incremental Hierarchical Semantic Scene Graphs with RAG-Style Retrieval

📅 2026-02-13
📈 Citations: 0
Influential: 0
📄 PDF

Technology Category

Application Category

📝 Abstract
Driven by advancements in foundation models, semantic scene graphs have emerged as a prominent paradigm for high-level 3D environmental abstraction in robot navigation. However, existing approaches are fundamentally misaligned with the needs of embodied tasks. As they rely on either offline batch processing or implicit feature embeddings, the maps can hardly support interpretable human-intent reasoning in complex environments. To address these limitations, we present INHerit-SG. We redefine the map as a structured, RAG-ready knowledge base where natural-language descriptions are introduced as explicit semantic anchors to better align with human intent. An asynchronous dual-process architecture, together with a Floor-Room-Area-Object hierarchy, decouples geometric segmentation from time-consuming semantic reasoning. An event-triggered map update mechanism reorganizes the graph only when meaningful semantic events occur. This strategy enables our graph to maintain long-term consistency with relatively low computational overhead. For retrieval, we deploy multi-role Large Language Models (LLMs) to decompose queries into atomic constraints and handle logical negations, and employ a hard-to-soft filtering strategy to ensure robust reasoning. This explicit interpretability improves the success rate and reliability of complex retrievals, enabling the system to adapt to a broader spectrum of human interaction tasks. We evaluate INHerit-SG on a newly constructed dataset, HM3DSem-SQR, and in real-world environments. Experiments demonstrate that our system achieves state-of-the-art performance on complex queries, and reveal its scalability for downstream navigation tasks. Project Page: https://fangyuktung.github.io/INHeritSG.github.io/
Problem

Research questions and friction points this paper is trying to address.

semantic scene graphs
human-intent reasoning
embodied tasks
interpretable AI
3D environmental abstraction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantic Scene Graph
RAG-style Retrieval
Incremental Hierarchical Mapping
Event-triggered Update
Multi-role LLMs
🔎 Similar Papers
No similar papers found.
Y
YukTungSamuel Fang
State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Z
Zhikang Shi
State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
J
Jiabin Qiu
State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Zixuan Chen
Zixuan Chen
Nanjing University
embodied AIimitation learningreinforcement learning.
J
Jieqi Shi
State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Hao Xu
Hao Xu
Beijing University of Posts and Telecommunications
Communication Theoryintelligent reflecting surfacespectral efficiency
Jing Huo
Jing Huo
Nanjing University
Machine LearningComputer Vision
Yang Gao
Yang Gao
Nanjing University, China
Artificial IntelligenceMachine LearningMulti-agent SystemsBig DataImage/Video Process