Cooperative Hybrid Multi-Agent Pathfinding Based on Shared Exploration Maps

📅 2025-03-28

📈 Citations: 0

✨ Influential: 0

career value

224K/year

🤖 AI Summary

To address the challenge of balancing global optimality and local responsiveness in multi-agent pathfinding under dynamic, partially observable environments, this paper proposes a hybrid framework integrating D* Lite with a multi-agent reinforcement learning variant (MAPPO). The method introduces three key innovations: (1) a shared exploration map representation for coordinated environmental awareness; (2) an adaptive online policy switching mechanism enabling context-aware transitions between global re-planning and local decision-making; and (3) a freezing prevention strategy to ensure continuous agent progress. It jointly leverages D* Lite’s incremental graph search, shared attention-based map encoding, and dynamic environment modeling. Evaluated on the POGEMA benchmark, the approach achieves a 21.3% improvement in task success rate, a 36.7% reduction in collision rate, and an 18.9% increase in path efficiency. Real-world validation on EyeSim confirms robust scalability to hundred-agent scenarios and high-dynamic environments.

Technology Category

Application Category

📝 Abstract

Multi-Agent Pathfinding is used in areas including multi-robot formations, warehouse logistics, and intelligent vehicles. However, many environments are incomplete or frequently change, making it difficult for standard centralized planning or pure reinforcement learning to maintain both global solution quality and local flexibility. This paper introduces a hybrid framework that integrates D* Lite global search with multi-agent reinforcement learning, using a switching mechanism and a freeze-prevention strategy to handle dynamic conditions and crowded settings. We evaluate the framework in the discrete POGEMA environment and compare it with baseline methods. Experimental outcomes indicate that the proposed framework substantially improves success rate, collision rate, and path efficiency. The model is further tested on the EyeSim platform, where it maintains feasible Pathfinding under frequent changes and large-scale robot deployments.

Problem

Research questions and friction points this paper is trying to address.

Handles dynamic and incomplete environments for multi-agent pathfinding

Integrates global search with reinforcement learning for flexibility

Improves success rate and efficiency in crowded dynamic settings

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid D* Lite and reinforcement learning framework

Switching mechanism for dynamic conditions

Freeze-prevention strategy in crowded settings

🔎 Similar Papers

Reevaluation of Large Neighborhood Search for MAPF: Findings and Opportunities