ReLaGS: Relational Language Gaussian Splatting

📅 2026-03-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing 3D perception methods often rely on object-centric modeling or extensive scene-specific training, hindering unified and efficient open-vocabulary reasoning. This work proposes a training-free, unified framework that constructs a hierarchical 3D scene representation by distilling language-aligned Gaussian splats, refines geometry through Gaussian pruning, and aggregates multi-view 2D features via language-guided alignment to produce precise 3D object embeddings. Building upon this representation, the method constructs an open-vocabulary 3D semantic scene graph that jointly models hierarchical semantics and intra- and inter-object relationships, enabling unified reasoning across segmentation, retrieval, and relational understanding. Experiments demonstrate that the approach is both efficient and scalable across multiple tasks.

Technology Category

Application Category

📝 Abstract
Achieving unified 3D perception and reasoning across tasks such as segmentation, retrieval, and relation understanding remains challenging, as existing methods are either object-centric or rely on costly training for inter-object reasoning. We present a novel framework that constructs a hierarchical language-distilled Gaussian scene and its 3D semantic scene graph without scene-specific training. A Gaussian pruning mechanism refines scene geometry, while a robust multi-view language alignment strategy aggregates noisy 2D features into accurate 3D object embeddings. On top of this hierarchy, we build an open-vocabulary 3D scene graph with Vision Language derived annotations and Graph Neural Network-based relational reasoning. Our approach enables efficient and scalable open-vocabulary 3D reasoning by jointly modeling hierarchical semantics and inter/intra-object relationships, validated across tasks including open-vocabulary segmentation, scene graph generation, and relation-guided retrieval. Project page: https://dfki-av.github.io/ReLaGS/
Problem

Research questions and friction points this paper is trying to address.

3D perception
relational reasoning
open-vocabulary
scene graph
unified reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Gaussian Splatting
Open-vocabulary 3D Reasoning
Scene Graph Generation
Multi-view Language Alignment
Hierarchical Semantic Modeling
🔎 Similar Papers
No similar papers found.
Y
Yaxu Xie
German Research Center for Artificial Intelligence (DFKI); RPTU University Kaiserslautern-Landau
A
Abdalla Arafa
German Research Center for Artificial Intelligence (DFKI); RPTU University Kaiserslautern-Landau
Alireza Javanmardi
Alireza Javanmardi
LMU Munich, MCML
Machine LearningConformal PredictionMulti-armed Bandits
C
Christen Millerdurai
German Research Center for Artificial Intelligence (DFKI)
Jia Cheng Hu
Jia Cheng Hu
Unimore
Deep LearningGPU
S
Shaoxiang Wang
German Research Center for Artificial Intelligence (DFKI); RPTU University Kaiserslautern-Landau
Alain Pagani
Alain Pagani
Principal Researcher, German Research Center for Artificial Intelligence (DFKI)
Computer VisionAugmented RealityArtificial IntelligenceImage Understanding
Didier Stricker
Didier Stricker
Professor for Computer Science, University Kaiserslautern
augmented realitycomputer visionimage processingbody sensor networkshci