SPOC: Safety-Aware Planning Under Partial Observability And Physical Constraints

πŸ“… 2026-02-25
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the safety challenges in embodied task planning under partial observability and physical constraints, a domain where existing benchmarks lack systematic evaluation of plan feasibility and safety. We propose the first safety-oriented benchmark for embodied task planning, which uniquely integrates strict partial observability, both explicit and implicit physical constraints, and diverse household hazard scenarios into a unified evaluation framework. The benchmark introduces state- and constraint-based online metrics and incorporates a goal-conditioned, step-by-step planning mechanism to enable fine-grained assessment of large language models’ safety-aware planning capabilities. Experimental results reveal that current state-of-the-art models struggle to ensure safety under implicit constraints, highlighting their limitations in real-world deployment.

Technology Category

Application Category

πŸ“ Abstract
Embodied Task Planning with large language models faces safety challenges in real-world environments, where partial observability and physical constraints must be respected. Existing benchmarks often overlook these critical factors, limiting their ability to evaluate both feasibility and safety. We introduce SPOC, a benchmark for safety-aware embodied task planning, which integrates strict partial observability, physical constraints, step-by-step planning, and goal-condition-based evaluation. Covering diverse household hazards such as fire, fluid, injury, object damage, and pollution, SPOC enables rigorous assessment through both state and constraint-based online metrics. Experiments with state-of-the-art LLMs reveal that current models struggle to ensure safety-aware planning, particularly under implicit constraints. Code and dataset are available at https://github.com/khm159/SPOC
Problem

Research questions and friction points this paper is trying to address.

safety-aware planning
partial observability
physical constraints
embodied task planning
LLM safety
Innovation

Methods, ideas, or system contributions that make the work stand out.

Safety-Aware Planning
Partial Observability
Physical Constraints
Embodied Task Planning
LLM Benchmarking
πŸ”Ž Similar Papers
No similar papers found.
Hyungmin Kim
Hyungmin Kim
Korea Institute of Science and Technology
Focused UltrasoundImage-Guided TherapyBrain-Machine Interface
H
Hobeom Jeon
ETRI School, University of Science and Technology, South Korea
D
Dohyung Kim
ETRI School, University of Science and Technology, South Korea; Social Robotics Laboratory, Electronics and Telecommunication Research Institute, South Korea
Minsu Jang
Minsu Jang
Assistant Project Scientist, University of California, Irvine
Jaehong Kim
Jaehong Kim
Yale University
Environmental EngineeringWater TreatmentEnvironmental NanotechnologyDisinfectionMembrane