Integrated Exploration and Sequential Manipulation on Scene Graph with LLM-based Situated Replanning

πŸ“… 2026-02-04
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of efficiently integrating exploration and task execution for robots operating in partially observable environments. The authors propose EPoG, a novel framework that uniquely combines large language models (LLMs) with a scene graph–based belief update mechanism. By continuously maintaining a belief graph that represents both known and unknown objects, the system generates goal-directed action sequences through graph-editing operations and leverages the LLM for local replanning. This approach enables seamless joint planning of exploration and manipulation, supporting long-horizon, multi-objective autonomous decision-making. Evaluated across 46 household scenarios and five complex task categories, EPoG achieves a success rate of 91.3% and reduces average travel distance by 36.1%, with real-robot experiments further demonstrating its effectiveness in dynamic, unknown environments.

Technology Category

Application Category

πŸ“ Abstract
In partially known environments, robots must combine exploration to gather information with task planning for efficient execution. To address this challenge, we propose EPoG, an Exploration-based sequential manipulation Planning framework on Scene Graphs. EPoG integrates a graph-based global planner with a Large Language Model (LLM)-based situated local planner, continuously updating a belief graph using observations and LLM predictions to represent known and unknown objects. Action sequences are generated by computing graph edit operations between the goal and belief graphs, ordered by temporal dependencies and movement costs. This approach seamlessly combines exploration and sequential manipulation planning. In ablation studies across 46 realistic household scenes and 5 long-horizon daily object transportation tasks, EPoG achieved a success rate of 91.3%, reducing travel distance by 36.1% on average. Furthermore, a physical mobile manipulator successfully executed complex tasks in unknown and dynamic environments, demonstrating EPoG's potential for real-world applications.
Problem

Research questions and friction points this paper is trying to address.

exploration
sequential manipulation
scene graph
task planning
partially known environments
Innovation

Methods, ideas, or system contributions that make the work stand out.

Scene Graph
Large Language Model (LLM)
Exploration-Planning Integration
Belief Graph
Sequential Manipulation
πŸ”Ž Similar Papers
No similar papers found.
H
Heqing Yang
Beihang University; State Key Laboratory of General Artificial Intelligence, BIGAI, Beijing, China
Ziyuan Jiao
Ziyuan Jiao
UCLA
RoboticsTask and Motion PlanningMobile ManipulationRobotic Manipulation
Shu Wang
Shu Wang
UCLA
RoboticsLanguage Models
Y
Yida Niu
State Key Laboratory of General Artificial Intelligence, BIGAI, Beijing, China; Institute for Artificial Intelligence, Peking University
Si Liu
Si Liu
Fred Hutchinson Cancer Center
GenomicsBiostatisticsAnomaly DetectionOpen Category Detection
Hangxin Liu
Hangxin Liu
Beijing Institute for General Artificial Intelligence (BIGAI)
RoboticsLocalizationSensors