Triple-S: A Collaborative Multi-LLM Framework for Solving Long-Horizon Implicative Tasks in Robotics

📅 2025-08-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
LLMs frequently misconfigure API parameters, omit critical annotations, and violate execution ordering constraints when executing long-horizon implicit robotic tasks. To address these challenges, this paper proposes a multi-LLM collaborative closed-loop framework. Our method introduces: (1) a role-based, three-stage collaboration mechanism—Simplification (task decomposition and understanding), Solution (code generation), and Summary (result abstraction)—to decouple and specialize each functional phase; and (2) a success-driven dynamic demonstration library that enables generalized repair of failed tasks via context-aware, real-time demonstration updates. Integrating in-context learning with adaptive demonstration retrieval, our approach achieves 89% task success on the LDIP benchmark across both fully and partially observable settings. Extensive validation confirms robust performance in both simulation and real-world robotic platforms.

Technology Category

Application Category

📝 Abstract
Leveraging Large Language Models (LLMs) to write policy code for controlling robots has gained significant attention. However, in long-horizon implicative tasks, this approach often results in API parameter, comments and sequencing errors, leading to task failure. To address this problem, we propose a collaborative Triple-S framework that involves multiple LLMs. Through In-Context Learning, different LLMs assume specific roles in a closed-loop Simplification-Solution-Summary process, effectively improving success rates and robustness in long-horizon implicative tasks. Additionally, a novel demonstration library update mechanism which learned from success allows it to generalize to previously failed tasks. We validate the framework in the Long-horizon Desktop Implicative Placement (LDIP) dataset across various baseline models, where Triple-S successfully executes 89% of tasks in both observable and partially observable scenarios. Experiments in both simulation and real-world robot settings further validated the effectiveness of Triple-S. Our code and dataset is available at: https://github.com/Ghbbbbb/Triple-S.
Problem

Research questions and friction points this paper is trying to address.

Addresses API and sequencing errors in robot policy code
Improves success rates in long-horizon implicative robotics tasks
Enhances robustness through multi-LLM collaboration and learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Collaborative multi-LLM framework for robotics
Simplification-Solution-Summary closed-loop process
Demonstration library update from success
🔎 Similar Papers
No similar papers found.
Z
Zixi Jia
Faculty of Robot Science and Engineering, Northeastern University, Shenyang, 110819, China
Hongbin Gao
Hongbin Gao
Northeastern University
Large language modelrobotprompt learning
F
Fashe Li
SIASUN Robot Automation CO. Ltd., Shenyang, 110819, China
Jiqiang Liu
Jiqiang Liu
Professor of Computer Science
Information securityPrivacy preserving
H
Hexiao Li
Faculty of Robot Science and Engineering, Northeastern University, Shenyang, 110819, China
Qinghua Liu
Qinghua Liu
OpenAI
Machine LearningReinforcement LearningGame Theory