Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning

📅 2025-06-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the dual challenges of unreachable targets in real-world robotic environments, rigid classical planning, and infeasible/unsafe plans generated by large language models (LLMs), this paper proposes a hierarchical task planning framework that synergistically integrates classical planning with LLMs. Our core contribution is a novel semantic-driven progressive goal relaxation mechanism: leveraging LLM-based commonsense reasoning, it jointly grounds semantic and geometric knowledge in a 3D scene graph, iteratively relaxing the original goal into context-adapted, functionally equivalent, and executable sub-goals. The method unifies PDDL-based symbolic planning, hierarchical task decomposition, and semantic grounding. Evaluated across diverse complex 3D scenarios, it significantly improves task success rates while ensuring safety and feasibility for long-horizon manipulation. Code, datasets, and evaluation benchmarks are publicly released.

Technology Category

Application Category

📝 Abstract
Classical planning in AI and Robotics addresses complex tasks by shifting from imperative to declarative approaches (e.g., PDDL). However, these methods often fail in real scenarios due to limited robot perception and the need to ground perceptions to planning predicates. This often results in heavily hard-coded behaviors that struggle to adapt, even with scenarios where goals can be achieved through relaxed planning. Meanwhile, Large Language Models (LLMs) lead to planning systems that leverage commonsense reasoning but often at the cost of generating unfeasible and/or unsafe plans. To address these limitations, we present an approach integrating classical planning with LLMs, leveraging their ability to extract commonsense knowledge and ground actions. We propose a hierarchical formulation that enables robots to make unfeasible tasks tractable by defining functionally equivalent goals through gradual relaxation. This mechanism supports partial achievement of the intended objective, suited to the agent's specific context. Our method demonstrates its ability to adapt and execute tasks effectively within environments modeled using 3D Scene Graphs through comprehensive qualitative and quantitative evaluations. We also show how this method succeeds in complex scenarios where other benchmark methods are more likely to fail. Code, dataset, and additional material are released to the community.
Problem

Research questions and friction points this paper is trying to address.

Classical planning fails in real scenarios due to limited perception.
LLMs generate unfeasible or unsafe plans despite commonsense reasoning.
Need hierarchical planning to relax goals for feasible execution.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates classical planning with LLMs
Uses hierarchical goal relaxation
Leverages 3D Scene Graphs
🔎 Similar Papers
No similar papers found.
E
Emanuele Musumeci
Department of Computer, Automation and Management Engineering, University of Rome “La Sapienza”, 00181 RM Rome, Italy
M
Michele Brienza
Department of Computer, Automation and Management Engineering, University of Rome “La Sapienza”, 00181 RM Rome, Italy
F
Francesco Argenziano
Department of Computer, Automation and Management Engineering, University of Rome “La Sapienza”, 00181 RM Rome, Italy
Vincenzo Suriani
Vincenzo Suriani
Sapienza University of Rome
Daniele Nardi
Daniele Nardi
Sapienza Univ. Roma, Dept. Computer, Control and Management Engineering
Artificial IntelligenceRoboticsMulti Agent Systems
Domenico D. Bloisi
Domenico D. Bloisi
Associate Professor, International University of Rome UNINT
robot visionintelligent surveillancebackground subtractionsteganographyrobot soccer