Modeling Others' Minds as Code

📅 2025-09-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address key bottlenecks in human-robot collaboration—namely low prediction accuracy, strong data dependency, poor robustness, and limited adaptability—this paper proposes ROTE, a novel framework that models human social behavior as executable programs and treats interactive behaviors as programmable “scripts.” Leveraging large language models (LLMs), ROTE generates a structured hypothesis space of behavioral programs and integrates probabilistic inference to quantify uncertainty—thereby relaxing restrictive rationality assumptions and eliminating reliance on large-scale annotated datasets. Evaluated on grid-world tasks and a large-scale embodied home simulator, ROTE achieves high-precision human behavior prediction from sparse observations alone. It improves in-distribution accuracy and cross-scenario generalization by 50% over state-of-the-art baselines, significantly enhancing the robustness and safety of human-robot collaborative systems.

Technology Category

Application Category

📝 Abstract
Accurate prediction of human behavior is essential for robust and safe human-AI collaboration. However, existing approaches for modeling people are often data-hungry and brittle because they either make unrealistic assumptions about rationality or are too computationally demanding to adapt rapidly. Our key insight is that many everyday social interactions may follow predictable patterns; efficient "scripts" that minimize cognitive load for actors and observers, e.g., "wait for the green light, then go." We propose modeling these routines as behavioral programs instantiated in computer code rather than policies conditioned on beliefs and desires. We introduce ROTE, a novel algorithm that leverages both large language models (LLMs) for synthesizing a hypothesis space of behavioral programs, and probabilistic inference for reasoning about uncertainty over that space. We test ROTE in a suite of gridworld tasks and a large-scale embodied household simulator. ROTE predicts human and AI behaviors from sparse observations, outperforming competitive baselines -- including behavior cloning and LLM-based methods -- by as much as 50% in terms of in-sample accuracy and out-of-sample generalization. By treating action understanding as a program synthesis problem, ROTE opens a path for AI systems to efficiently and effectively predict human behavior in the real-world.
Problem

Research questions and friction points this paper is trying to address.

Predicting human behavior for safe human-AI collaboration
Overcoming data-hungry and brittle existing modeling approaches
Modeling social interactions as efficient behavioral programs
Innovation

Methods, ideas, or system contributions that make the work stand out.

Modeling human routines as behavioral programs
Leveraging LLMs for synthesizing behavioral programs
Using probabilistic inference for uncertainty reasoning
🔎 Similar Papers
No similar papers found.
K
Kunal Jha
Department of Computer Science, University of Washington, Seattle, WA
A
Aydan Yuenan Huang
Department of Computer Science, Johns Hopkins University, Baltimore, MD
E
Eric Ye
Department of Computer Science, University of Washington, Seattle, WA
Natasha Jaques
Natasha Jaques
University of Washington, Google Research
Social reinforcement learningMachine learningdeep learningmulti-agenthuman-AI interaction
Max Kleiman-Weiner
Max Kleiman-Weiner
University of Washington
Cognitive ScienceCooperationReinforcement LearningConsumer BehaviorArtificial Intelligence