Large language model-based task planning for service robots: A review

📅 2025-10-27
📈 Citations: 0
Influential: 0
📄 PDF

career value

217K/year
🤖 AI Summary
This study addresses the limited task planning capability of service robots in complex domestic environments. We propose a novel architecture centered on large language models (LLMs) as the cognitive engine. Methodologically, the approach integrates pretrained foundation models, domain-specific fine-tuning, retrieval-augmented generation (RAG), and multimodal prompt engineering to enable semantic understanding, high-level task decomposition, and autonomous decision-making from textual, visual, and auditory inputs. Our key contribution is an end-to-end LLM-driven planning framework that eliminates reliance on explicit symbolic modeling—characteristic of traditional planners—and systematically identifies current technical bottlenecks and evolutionary pathways. Experimental evaluation and comprehensive review demonstrate substantial improvements in task generalization and environmental adaptability. The work establishes a scalable technical paradigm for AI-robotics integration and delineates concrete directions for future advancement. (149 words)

Technology Category

Application Category

📝 Abstract
With the rapid advancement of large language models (LLMs) and robotics, service robots are increasingly becoming an integral part of daily life, offering a wide range of services in complex environments. To deliver these services intelligently and efficiently, robust and accurate task planning capabilities are essential. This paper presents a comprehensive overview of the integration of LLMs into service robotics, with a particular focus on their role in enhancing robotic task planning. First, the development and foundational techniques of LLMs, including pre-training, fine-tuning, retrieval-augmented generation (RAG), and prompt engineering, are reviewed. We then explore the application of LLMs as the cognitive core-`brain'-of service robots, discussing how LLMs contribute to improved autonomy and decision-making. Furthermore, recent advancements in LLM-driven task planning across various input modalities are analyzed, including text, visual, audio, and multimodal inputs. Finally, we summarize key challenges and limitations in current research and propose future directions to advance the task planning capabilities of service robots in complex, unstructured domestic environments. This review aims to serve as a valuable reference for researchers and practitioners in the fields of artificial intelligence and robotics.
Problem

Research questions and friction points this paper is trying to address.

Enhancing robotic task planning using large language models
Applying LLMs as cognitive core for service robots autonomy
Addressing task planning challenges in complex domestic environments
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-based cognitive core for robot decision-making
Multimodal input integration for task planning
Retrieval-augmented generation for enhanced planning
S
Shaohan Bian
School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
Y
Ying Zhang
School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China; Engineering Research Center of Intelligent Control System and Intelligent Equipment, Ministry of Education, Yanshan University, Qinhuangdao 066004, China; Hebei Key Laboratory of Intelligent Rehabilitation and Neuromodulation, Yanshan University, Qinhuangdao 066004, China
G
Guohui Tian
School of Control Science and Engineering, Shandong University, Jinan 250061, China
Zhiqiang Miao
Zhiqiang Miao
Professor, Hunan University
Multi-Robot SystemsUnmanned SystemsCooperative ControlMotion Planning
E
Edmond Q. Wu
Department of Automation, Shanghai Jiao Tong University, Shanghai 200240, China
S
Simon X. Yang
Advanced Robotics and Intelligent Systems Laboratory, School of Engineering, University of Guelph, Guelph N1G 2W1, Canada
Changchun Hua
Changchun Hua
Yanshan University
control and systemsnetworked systemsteleoperation systems