🤖 AI Summary
This work addresses the profit maximization problem for centralized operators in Autonomous Mobility-on-Demand (AMoD) systems, focusing on the sequential joint decision-making of vehicle dispatching and dynamic rebalancing. We propose a vehicle-centric multi-agent reinforcement learning framework, introducing a novel global-action-aware critic loss function. Furthermore, we pioneer the integration of Soft Actor-Critic (SAC) with weighted bipartite graph matching to jointly optimize request-vehicle assignment and coordinated rebalancing. The proposed method improves dispatching performance by 12.9% over the current state-of-the-art (SOTA). When combined with the integrated rebalancing strategy, overall operational profit increases by 38.9%. These gains significantly enhance both the economic viability and scalability of AMoD systems.
📝 Abstract
We study a sequential decision-making problem for a profit-maximizing operator of an Autonomous Mobility-on-Demand system. Optimizing a central operator's vehicle-to-request dispatching policy requires efficient and effective fleet control strategies. To this end, we employ a multi-agent Soft Actor-Critic algorithm combined with weighted bipartite matching. We propose a novel vehicle-based algorithm architecture and adapt the critic's loss function to appropriately consider global actions. Furthermore, we extend our algorithm to incorporate rebalancing capabilities. Through numerical experiments, we show that our approach outperforms state-of-the-art benchmarks by up to 12.9% for dispatching and up to 38.9% with integrated rebalancing.