Edge-Centric Relational Reasoning for 3D Scene Graph Prediction

📅 2025-11-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing 3D scene graph prediction methods predominantly adopt object-centric graph neural networks (GNNs), which struggle to capture high-order relational dependencies among entities. Method: This paper proposes a relation-centric inference paradigm: it transforms the original object-centric graph into a line graph—where relations become nodes and higher-order associations become edges—and designs a novel edge-centric line graph neural network. A link-guided mechanism is introduced to suppress noisy relations, enabling progressive fusion of relation-level contextual information into object-level understanding. The framework is modular and compatible with arbitrary baseline methods. Key components include line graph construction, object-aware feature fusion, link prediction, and multi-granularity message passing. Results: On the 3DSSG benchmark, our method consistently outperforms two strong baselines across all metrics, validating both the effectiveness and generalizability of the relation-to-object reasoning paradigm.

Technology Category

Application Category

📝 Abstract
3D scene graph prediction aims to abstract complex 3D environments into structured graphs consisting of objects and their pairwise relationships. Existing approaches typically adopt object-centric graph neural networks, where relation edge features are iteratively updated by aggregating messages from connected object nodes. However, this design inherently restricts relation representations to pairwise object context, making it difficult to capture high-order relational dependencies that are essential for accurate relation prediction. To address this limitation, we propose a Link-guided Edge-centric relational reasoning framework with Object-aware fusion, namely LEO, which enables progressive reasoning from relation-level context to object-level understanding. Specifically, LEO first predicts potential links between object pairs to suppress irrelevant edges, and then transforms the original scene graph into a line graph where each relation is treated as a node. A line graph neural network is applied to perform edge-centric relational reasoning to capture inter-relation context. The enriched relation features are subsequently integrated into the original object-centric graph to enhance object-level reasoning and improve relation prediction. Our framework is model-agnostic and can be integrated with any existing object-centric method. Experiments on the 3DSSG dataset with two competitive baselines show consistent improvements, highlighting the effectiveness of our edge-to-object reasoning paradigm.
Problem

Research questions and friction points this paper is trying to address.

Existing methods restrict relation representations to pairwise object context
Current approaches struggle to capture high-order relational dependencies
Object-centric graph networks limit accurate 3D scene graph prediction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Link-guided Edge-centric relational reasoning framework
Transforms scene graph into line graph representation
Integrates enriched relation features into object-centric graph
🔎 Similar Papers
No similar papers found.
Y
Yanni Ma
School of Electronics and Communication Engineering, Sun Yat-sen University, Shenzhen, China
H
Hao Liu
Key Lab of Spatial-temporal Big Data Analysis and Application of Natural Resources in Megacities, Ministry of Natural Resources, East China Normal University (ECNU), Shanghai, China
Yulan Guo
Yulan Guo
Professor, Sun Yat-sen University
3D VisionMachine LearningRobotics
Theo Gevers
Theo Gevers
Computer Vision Research Group, University of Amsterdam (UvA); 3DUniversum
computer visionimage understandingdeep learningobject recognitioncolor
Martin R. Oswald
Martin R. Oswald
University of Amsterdam
3D Computer VisionRepresentation LearningApplied Machine LearningOptimization