Learning to Interact in World Latent for Team Coordination

📅 2025-09-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In multi-agent reinforcement learning (MARL), partial observability leads to incomplete information, while explicit communication suffers from high overhead and poor robustness. To address these challenges, this paper proposes Interactive World Latent Representations (IWoL). IWoL learns a shared, task-relevant latent space that implicitly encodes inter-agent relationships and environment dynamics, enabling efficient decentralized execution and implicit coordination; it also supports optional explicit communication via a unified interface. By jointly optimizing representation learning and implicit communication, IWoL uses the same latent encoding for both state representation and inter-agent information exchange. Evaluated on four standard MARL benchmarks—including StarCraft II and Multi-Agent MuJoCo—IWoL consistently outperforms state-of-the-art methods, demonstrating superior generalizability, adaptability to diverse tasks, and robustness under partial observability and communication constraints.

Technology Category

Application Category

📝 Abstract
This work presents a novel representation learning framework, interactive world latent (IWoL), to facilitate team coordination in multi-agent reinforcement learning (MARL). Building effective representation for team coordination is a challenging problem, due to the intricate dynamics emerging from multi-agent interaction and incomplete information induced by local observations. Our key insight is to construct a learnable representation space that jointly captures inter-agent relations and task-specific world information by directly modeling communication protocols. This representation, we maintain fully decentralized execution with implicit coordination, all while avoiding the inherent drawbacks of explicit message passing, e.g., slower decision-making, vulnerability to malicious attackers, and sensitivity to bandwidth constraints. In practice, our representation can be used not only as an implicit latent for each agent, but also as an explicit message for communication. Across four challenging MARL benchmarks, we evaluate both variants and show that IWoL provides a simple yet powerful key for team coordination. Moreover, we demonstrate that our representation can be combined with existing MARL algorithms to further enhance their performance.
Problem

Research questions and friction points this paper is trying to address.

Learning decentralized multi-agent coordination without explicit communication
Building representations capturing agent relations and world dynamics
Overcoming limitations of message passing in partial observability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Learning interactive world latent for team coordination
Constructing representation space capturing agent relations
Enabling decentralized execution with implicit coordination
🔎 Similar Papers
No similar papers found.