Enabling Multi-Robot Collaboration from Single-Human Guidance

πŸ“… 2024-09-30
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 1
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Multi-robot collaboration typically requires extensive expert demonstrations or joint reward designβ€”both costly and impractical for real-world deployment. Method: This paper proposes an explicit collaborative learning framework enabled by brief human guidance (40 minutes), featuring: (1) dynamic switching of controlled agents to emulate human role plasticity; (2) attention-based modeling of Theory of Mind (ToM) for inferring teammate intentions; and (3) integration of a hierarchical policy network with real-time role assignment. Contribution/Results: The framework eliminates reliance on multi-agent demonstrations or shared reward signals, achieving the first instance of explicit multi-agent collaboration learning from single-user supervision. Evaluated on a simulated cooperative hide-and-seek task, it improves success rate by 58% over baselines and successfully transfers to a physical multi-robot platform, demonstrating strong generalization and practical applicability.

Technology Category

Application Category

πŸ“ Abstract
Learning collaborative behaviors is essential for multi-agent systems. Traditionally, multi-agent reinforcement learning solves this implicitly through a joint reward and centralized observations, assuming collaborative behavior will emerge. Other studies propose to learn from demonstrations of a group of collaborative experts. Instead, we propose an efficient and explicit way of learning collaborative behaviors in multi-agent systems by leveraging expertise from only a single human. Our insight is that humans can naturally take on various roles in a team. We show that agents can effectively learn to collaborate by allowing a human operator to dynamically switch between controlling agents for a short period and incorporating a human-like theory-of-mind model of teammates. Our experiments showed that our method improves the success rate of a challenging collaborative hide-and-seek task by up to 58% with only 40 minutes of human guidance. We further demonstrate our findings transfer to the real world by conducting multi-robot experiments.
Problem

Research questions and friction points this paper is trying to address.

Learning collaborative multi-agent behaviors
Single-human guidance in multi-robot systems
Improving task success with human-like theory-of-mind
Innovation

Methods, ideas, or system contributions that make the work stand out.

Single-human guidance
Dynamic role switching
Human-like theory-of-mind
πŸ”Ž Similar Papers
No similar papers found.
Zhengran Ji
Zhengran Ji
Duke University
RoboticsReinforcement LearningMulti-agent SystemsRLHF
L
Lingyu Zhang
Duke University
P
P. Sajda
Columbia University
B
Boyuan Chen
Duke University