🤖 AI Summary
In robotics, heterogeneous scene description formats—such as MJCF, URDF, and SDF—are mutually incompatible, severely impeding unified environmental knowledge modeling and semantic reasoning.
Method: This paper proposes a Unified Scene Graph Model grounded in Universal Scene Description (USD), enabling standardized, semantics-preserving fusion of multi-source formats into a coherent USD representation. It introduces a robot-body-aligned semantic mapping and annotation framework to construct an executable, task-oriented knowledge base, integrated with ontology-based reasoning and Web-based visualization tools for semantic querying and environment management.
Results: Experiments demonstrate automatic conversion of procedural 3D environments into semantically enriched USD scenes, generation of structured knowledge graphs, and real-time, interpretable robotic decision-making in capability-validation tasks. The approach significantly enhances cross-format environmental understanding and cognitive reasoning capabilities.
📝 Abstract
In robotics, the effective integration of environmental data into actionable knowledge remains a significant challenge due to the variety and incompatibility of data formats commonly used in scene descriptions, such as MJCF, URDF, and SDF. This paper presents a novel approach that addresses these challenges by developing a unified scene graph model that standardizes these varied formats into the Universal Scene Description (USD) format. This standardization facilitates the integration of these scene graphs with robot ontologies through semantic reporting, enabling the translation of complex environmental data into actionable knowledge essential for cognitive robotic control. We evaluated our approach by converting procedural 3D environments into USD format, which is then annotated semantically and translated into a knowledge graph to effectively answer competency questions, demonstrating its utility for real-time robotic decision-making. Additionally, we developed a web-based visualization tool to support the semantic mapping process, providing users with an intuitive interface to manage the 3D environment.