GRATE: a Graph transformer-based deep Reinforcement learning Approach for Time-efficient autonomous robot Exploration

📅 2025-09-16

📈 Citations: 0

✨ Influential: 0

career value

210K/year

🤖 AI Summary

Existing reinforcement learning–based autonomous exploration methods exhibit weak global reasoning capabilities in graph-structured environments and neglect motion dynamics constraints, leading to policies that optimize path length at the expense of temporal efficiency. This work proposes a graph transformer–enhanced deep reinforcement learning framework: first, a graph transformer models the environmental information graph to capture long-range structural dependencies; second, motion-smoothness priors are embedded into the policy network, and Kalman filtering is applied post-hoc to waypoints to ensure dynamical feasibility. The framework jointly optimizes exploration coverage, traversal distance, and time cost. Experiments across multiple simulated environments demonstrate average reductions of 21.5% in exploration distance and 21.3% in time consumption. The method further validates planning robustness and real-time performance on a physical robot platform.

Technology Category

Application Category

📝 Abstract

Autonomous robot exploration (ARE) is the process of a robot autonomously navigating and mapping an unknown environment. Recent Reinforcement Learning (RL)-based approaches typically formulate ARE as a sequential decision-making problem defined on a collision-free informative graph. However, these methods often demonstrate limited reasoning ability over graph-structured data. Moreover, due to the insufficient consideration of robot motion, the resulting RL policies are generally optimized to minimize travel distance, while neglecting time efficiency. To overcome these limitations, we propose GRATE, a Deep Reinforcement Learning (DRL)-based approach that leverages a Graph Transformer to effectively capture both local structure patterns and global contextual dependencies of the informative graph, thereby enhancing the model's reasoning capability across the entire environment. In addition, we deploy a Kalman filter to smooth the waypoint outputs, ensuring that the resulting path is kinodynamically feasible for the robot to follow. Experimental results demonstrate that our method exhibits better exploration efficiency (up to 21.5% in distance and 21.3% in time to complete exploration) than state-of-the-art conventional and learning-based baselines in various simulation benchmarks. We also validate our planner in real-world scenarios.

Problem

Research questions and friction points this paper is trying to address.

Enhancing robot reasoning over graph data for exploration

Optimizing time efficiency in autonomous robot navigation

Ensuring kinodynamic feasibility of generated robot paths

Innovation

Methods, ideas, or system contributions that make the work stand out.

Graph Transformer captures local and global graph dependencies

Kalman filter ensures kinodynamically feasible robot paths

Deep Reinforcement Learning optimizes time-efficient exploration

🔎 Similar Papers

Multi-Robot Informative Path Planning for Efficient Target Mapping using Deep Reinforcement Learning