MAD-PINN: A Decentralized Physics-Informed Machine Learning Framework for Safe and Optimal Multi-Agent Control

📅 2025-09-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the challenge of jointly optimizing safety and performance in large-scale multi-agent systems—where existing MARL, MPC, and safety-filtering approaches lack formal guarantees or suffer from poor scalability—this paper proposes a decentralized physics-informed machine learning framework. Methodologically, it (1) reformulates state-constrained optimal control as an unconstrained optimization via epigraph lifting; (2) integrates Hamilton–Jacobi reachability analysis into a dynamic neighbor selection mechanism to ensure provable safety and local adaptivity; and (3) employs a lightweight physics-informed neural network (PINN) to approximate the value function—trained on a reduced-order system and deployed fully distributively, relying solely on local observations for decision-making. Evaluated on multi-agent navigation tasks, the framework achieves superior safety-performance trade-offs, demonstrates strong scalability to hundreds of agents, and supports real-time execution.

Technology Category

Application Category

📝 Abstract
Co-optimizing safety and performance in large-scale multi-agent systems remains a fundamental challenge. Existing approaches based on multi-agent reinforcement learning (MARL), safety filtering, or Model Predictive Control (MPC) either lack strict safety guarantees, suffer from conservatism, or fail to scale effectively. We propose MAD-PINN, a decentralized physics-informed machine learning framework for solving the multi-agent state-constrained optimal control problem (MASC-OCP). Our method leverages an epigraph-based reformulation of SC-OCP to simultaneously capture performance and safety, and approximates its solution via a physics-informed neural network. Scalability is achieved by training the SC-OCP value function on reduced-agent systems and deploying them in a decentralized fashion, where each agent relies only on local observations of its neighbours for decision-making. To further enhance safety and efficiency, we introduce an Hamilton-Jacobi (HJ) reachability-based neighbour selection strategy to prioritize safety-critical interactions, and a receding-horizon policy execution scheme that adapts to dynamic interactions while reducing computational burden. Experiments on multi-agent navigation tasks demonstrate that MAD-PINN achieves superior safety-performance trade-offs, maintains scalability as the number of agents grows, and consistently outperforms state-of-the-art baselines.
Problem

Research questions and friction points this paper is trying to address.

Solving multi-agent state-constrained optimal control problems
Ensuring strict safety guarantees while optimizing performance
Achieving scalability in large-scale decentralized multi-agent systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decentralized physics-informed neural network for multi-agent control
Epigraph reformulation to jointly optimize safety and performance
Hamilton-Jacobi reachability-based neighbor selection for safety
🔎 Similar Papers
No similar papers found.
Manan Tayal
Manan Tayal
Indian Institute of Science (IISc), Bangalore
Safe ControlRobot LearningAIRobotics
A
Aditya Singh
Center for Cyber-Physical Systems, Indian Institute of Science, Bangalore, India
S
Shishir Kolathaya
Center for Cyber-Physical Systems, Indian Institute of Science, Bangalore, India
Somil Bansal
Somil Bansal
Assistant Professor, Stanford University
RoboticsArtificial intelligenceDynamic systems and control