General agents need world models

📅 2025-06-02

📈 Citations: 0

✨ Influential: 0

career value

228K/year

🤖 AI Summary

Can general-purpose agents achieve flexible, multi-step, goal-directed behavior solely via model-free learning? This paper provides the first formal proof that world models are necessary for universal goal-directed generalization. We introduce a novel paradigm for *inverse extraction* of world models from trained policies—enabling high-fidelity model recovery without explicit model-based training. Our framework establishes quantitative relationships among goal complexity, policy performance, and world model accuracy, integrating tools from control theory, causal representation learning, and policy interpretability analysis. Key contributions include: (1) a necessity theorem proving that world models are indispensable for universal goal-directed generalization; (2) design principles for safe and controllable agents grounded in world model fidelity; (3) a characterization framework for environmental capability boundaries; and (4) a high-accuracy, model-free world model extraction algorithm.

Technology Category

Application Category

📝 Abstract

Are world models a necessary ingredient for flexible, goal-directed behaviour, or is model-free learning sufficient? We provide a formal answer to this question, showing that any agent capable of generalizing to multi-step goal-directed tasks must have learned a predictive model of its environment. We show that this model can be extracted from the agent's policy, and that increasing the agents performance or the complexity of the goals it can achieve requires learning increasingly accurate world models. This has a number of consequences: from developing safe and general agents, to bounding agent capabilities in complex environments, and providing new algorithms for eliciting world models from agents.

Problem

Research questions and friction points this paper is trying to address.

Determine if world models are essential for flexible goal-directed behavior

Show that generalization requires predictive environment models

Link improved performance to more accurate world models

Innovation

Methods, ideas, or system contributions that make the work stand out.

Extracting world models from agent policies

Improving performance with accurate models

Developing algorithms for model elicitation

🔎 Similar Papers

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond