🤖 AI Summary
Reinforcement learning for multi-robot collaborative manipulation suffers from high data demand and strict Markovian assumptions, while existing Decision Transformers (DTs) lack hierarchical modeling and interpretability. Method: We propose a symbol-guided hierarchical decision framework—the first to integrate DTs into multi-robot manipulation. It employs a neuro-symbolic planner to generate interpretable symbolic subgoals for high-level semantic planning, and a goal-conditioned Decision Transformer (GCDT) to model long-horizon action sequences at the low level. The framework unifies causal Transformer architectures with neuro-symbolic reasoning to enable zero-shot and few-shot cross-task transfer. Contribution/Results: Experiments demonstrate significant improvements in deployment efficiency and generalization capability of multi-robot systems operating in complex, dynamic environments, validating the framework’s scalability, interpretability, and sample efficiency.
📝 Abstract
Reinforcement learning (RL) has demonstrated great potential in robotic operations. However, its data-intensive nature and reliance on the Markov Decision Process (MDP) assumption limit its practical deployment in real-world scenarios involving complex dynamics and long-term temporal dependencies, such as multi-robot manipulation. Decision Transformers (DTs) have emerged as a promising offline alternative by leveraging causal transformers for sequence modeling in RL tasks. However, their applications to multi-robot manipulations still remain underexplored. To address this gap, we propose a novel framework, Symbolically-Guided Decision Transformer (SGDT), which integrates a neuro-symbolic mechanism with a causal transformer to enable deployable multi-robot collaboration. In the proposed SGDT framework, a neuro-symbolic planner generates a high-level task-oriented plan composed of symbolic subgoals. Guided by these subgoals, a goal-conditioned decision transformer (GCDT) performs low-level sequential decision-making for multi-robot manipulation. This hierarchical architecture enables structured, interpretable, and generalizable decision making in complex multi-robot collaboration tasks. We evaluate the performance of SGDT across a range of task scenarios, including zero-shot and few-shot scenarios. To our knowledge, this is the first work to explore DT-based technology for multi-robot manipulation.