Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents

πŸ“… 2026-04-30
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

200K/year
πŸ€– AI Summary
This work addresses the limitation of existing embodied agents, which often rely on passive responses and lack high-level value mechanisms to support sustained autonomous behavior and resolve motivational conflicts. To bridge this gap, the authors propose ValuePlanner, a hierarchical cognitive architecture that leverages large language models to reason about abstract value trade-offs and generate symbolic subgoals, which are then translated into executable plans by a PDDL planner. Integrated with closed-loop feedback, this framework enables self-driven, coherent long-horizon behaviors. The study presents the first structured approach linking intrinsic values to embodied actions and introduces an evaluation protocol centered on cumulative value gain, preference alignment, and behavioral diversity. Evaluated in the TongSim household environment, the system significantly outperforms instruction-following and need-driven baselines, demonstrating superior performance in value coordination and behavioral richness.
πŸ“ Abstract
Current embodied agents are often limited to passive instruction-following or reactive need-satisfaction, lacking a stable, high-order value framework essential for long-term, self-directed behavior and resolving motivational conflicts. We introduce \textit{ValuePlanner}, a hierarchical cognitive architecture that decouples high-level value scheduling from low-level action execution. \textit{ValuePlanner} employs an LLM-based cognitive module to generate symbolic subgoals by reasoning through abstract value trade-offs, which are then translated into executable action plans by a classical PDDL planner. This process is refined via a closed-loop feedback mechanism. Evaluating such autonomy requires methods beyond task-success rates, and we therefore propose a value-centric evaluation suite measuring cumulative value gain, preference alignment, and behavioral diversity. Experiments in the TongSim household environment demonstrate that \textit{ValuePlanner} arbitrates competing values to generate coherent, long-horizon, self-directed behavior absent from instruction-following and needs-driven baselines. Our work offers a structured approach to bridging intrinsic values and grounded behavior for autonomous agents.
Problem

Research questions and friction points this paper is trying to address.

embodied agents
value framework
autonomous behavior
motivational conflicts
long-horizon planning
Innovation

Methods, ideas, or system contributions that make the work stand out.

ValuePlanner
hierarchical cognitive architecture
value-driven behavior
LLM-based reasoning
autonomous embodied agents
πŸ”Ž Similar Papers
2024-07-09IEEE/ASME transactions on mechatronicsCitations: 94
C
Chunhui Zhang
State Key Laboratory of General Artificial Intelligence, BIGAI
Yuxuan Wang
Yuxuan Wang
Peking University
Omni-LMMultimodal Agent
A
Aoyang Qin
Tsinghua University
Yi-Long Lu
Yi-Long Lu
Peking University
decision makingproblem solvingcomputational modeling
K
Kunlun Wu
State Key Laboratory of General Artificial Intelligence, BIGAI
Y
Yizhou Wang
School of Computer Science, Peking University; Nat’l Eng. Research Center of Visual Technology, Peking University; Institute for AI, Peking University; State Key Laboratory of General Artificial Intelligence, Peking University
W
Wei Wang
State Key Laboratory of General Artificial Intelligence, BIGAI