BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs

πŸ“… 2025-04-03
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Natural language instructions for kitchen-assistant robots are often abstract, ambiguous, or non-executable, leading to planning failures. Method: We propose the first test-driven BT-LLM co-design framework that tightly integrates large language models’ (LLMs) linguistic reasoning with behavior trees’ (BTs) interpretability, formal verifiability, and interruptibility. Guided by test-driven development (TDD), the LLM generates structured, verifiable subtask sequences, enabling a closed-loop pipeline of instruction understanding, action generation, and execution feedback. Contribution/Results: In a 45-participant user study, our approach significantly reduces execution error rates and improves user trust and preference. The implementation is open-sourced; empirical evaluation on real robotic hardware demonstrates strong effectiveness and robustness in practical kitchen scenarios.

Technology Category

Application Category

πŸ“ Abstract
Natural language instructions are often abstract and complex, requiring robots to execute multiple subtasks even for seemingly simple queries. For example, when a user asks a robot to prepare avocado toast, the task involves several sequential steps. Moreover, such instructions can be ambiguous or infeasible for the robot or may exceed the robot's existing knowledge. While Large Language Models (LLMs) offer strong language reasoning capabilities to handle these challenges, effectively integrating them into robotic systems remains a key challenge. To address this, we propose BT-ACTION, a test-driven approach that combines the modular structure of Behavior Trees (BT) with LLMs to generate coherent sequences of robot actions for following complex user instructions, specifically in the context of preparing recipes in a kitchen-assistance setting. We evaluated BT-ACTION in a comprehensive user study with 45 participants, comparing its performance to direct LLM prompting. Results demonstrate that the modular design of BT-ACTION helped the robot make fewer mistakes and increased user trust, and participants showed a significant preference for the robot leveraging BT-ACTION. The code is publicly available at https://github.com/1Eggbert7/BT_LLM.
Problem

Research questions and friction points this paper is trying to address.

Handling abstract complex natural language instructions for robots
Resolving ambiguity and feasibility in user instructions for robots
Integrating LLMs with robotic systems for coherent action sequences
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines Behavior Trees with LLMs
Test-driven modular action sequencing
Enhances robot accuracy and trust
πŸ”Ž Similar Papers
No similar papers found.