DeReCo: Decoupling Representation and Coordination Learning for Object-Adaptive Decentralized Multi-Robot Cooperative Transport

📅 2026-03-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of coupled interference between object representation and coordination policy in decentralized multi-robot cooperative transport, arising from partial observability and non-stationarity. To disentangle these components, the paper proposes DeReCo, a three-stage training framework: first, a coordination policy is trained under centralized settings using privileged information; second, object representations are reconstructed from local observations; and third, privileged information is progressively removed to enable fully decentralized execution. This approach substantially improves sample efficiency and cross-scenario generalization to objects with diverse shapes and physical properties. Experimental results demonstrate that DeReCo outperforms existing baselines in simulation, successfully generalizes to six previously unseen objects, and enables real-world robots to efficiently accomplish cooperative transport tasks with two novel objects.

Technology Category

Application Category

📝 Abstract
Generalizing decentralized multi-robot cooperative transport across objects with diverse shapes and physical properties remains a fundamental challenge. Under decentralized execution, two key challenges arise: object-dependent representation learning under partial observability and coordination learning in multi-agent reinforcement learning (MARL) under non-stationarity. A typical approach jointly optimizes object-dependent representations and coordinated policies in an end-to-end manner while randomizing object shapes and physical properties during training. However, this joint optimization tightly couples representation and coordination learning, introducing bidirectional interference: inaccurate representations under partial observability destabilize coordination learning, while non-stationarity in MARL further degrades representation learning, resulting in sample-inefficient training. To address this structural coupling, we propose DeReCo, a novel MARL framework that decouples representation and coordination learning for object-adaptive multi-robot cooperative transport, improving sample efficiency and generalization across objects and transport scenarios. DeReCo adopts a three-stage training strategy: (1) centralized coordination learning with privileged object information, (2) reconstruction of object-dependent representations from local observations, and (3) progressive removal of privileged information for decentralized execution. This decoupling mitigates interference between representation and coordination learning and enables stable and sample-efficient training. Experimental results show that DeReCo outperforms baselines in simulation on three training objects, generalizes to six unseen objects with varying masses and friction coefficients, and achieves superior performance on two unseen objects in real-robot experiments.
Problem

Research questions and friction points this paper is trying to address.

decentralized multi-robot cooperative transport
object generalization
representation learning
coordination learning
non-stationarity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decoupling
Multi-Agent Reinforcement Learning
Object-Adaptive Transport
Sample Efficiency
Decentralized Coordination
🔎 Similar Papers
No similar papers found.
K
Kazuki Shibata
Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology (NAIST), Nara, Japan
R
Ryosuke Sota
Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology (NAIST), Nara, Japan
S
Shandil Dhiresh Bosch
Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology (NAIST), Nara, Japan; Department of Cognitive Robotics, Faculty of Mechanical Engineering, Delft University of Technology, Delft, Netherlands
Yuki Kadokawa
Yuki Kadokawa
Nara Institute of Science and Technology
Reinforcement LearningRoboticsSim-to-RealMachine Learning
T
Tsurumine Yoshihisa
Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology (NAIST), Nara, Japan
Takamitsu Matsubara
Takamitsu Matsubara
Nara Institute of Science and Technology
Robot LearningMachine LearningReinforcement LearningRobotics