SPOC: Safety-Aware Planning Under Partial Observability And Physical Constraints

πŸ“… 2026-02-25
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

233K/year
πŸ€– AI Summary
This work addresses the safety challenges in embodied task planning under partial observability and physical constraints, a domain where existing benchmarks lack systematic evaluation of plan feasibility and safety. We propose the first safety-oriented benchmark for embodied task planning, which uniquely integrates strict partial observability, both explicit and implicit physical constraints, and diverse household hazard scenarios into a unified evaluation framework. The benchmark introduces state- and constraint-based online metrics and incorporates a goal-conditioned, step-by-step planning mechanism to enable fine-grained assessment of large language models’ safety-aware planning capabilities. Experimental results reveal that current state-of-the-art models struggle to ensure safety under implicit constraints, highlighting their limitations in real-world deployment.

Technology Category

Application Category

πŸ“ Abstract
Embodied Task Planning with large language models faces safety challenges in real-world environments, where partial observability and physical constraints must be respected. Existing benchmarks often overlook these critical factors, limiting their ability to evaluate both feasibility and safety. We introduce SPOC, a benchmark for safety-aware embodied task planning, which integrates strict partial observability, physical constraints, step-by-step planning, and goal-condition-based evaluation. Covering diverse household hazards such as fire, fluid, injury, object damage, and pollution, SPOC enables rigorous assessment through both state and constraint-based online metrics. Experiments with state-of-the-art LLMs reveal that current models struggle to ensure safety-aware planning, particularly under implicit constraints. Code and dataset are available at https://github.com/khm159/SPOC
Problem

Research questions and friction points this paper is trying to address.

safety-aware planning
partial observability
physical constraints
embodied task planning
LLM safety
Innovation

Methods, ideas, or system contributions that make the work stand out.

Safety-Aware Planning
Partial Observability
Physical Constraints
Embodied Task Planning
LLM Benchmarking
πŸ”Ž Similar Papers
No similar papers found.