Formalizing Embeddedness Failures in Universal Artificial Intelligence

📅 2025-05-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses a fundamental failure mode of the AIXI agent under the embedded intelligence paradigm: its structural collapse—manifesting as self-referential inconsistency, resource-agnosticism, and model self-destruction—arising from neglecting its own physical embedding and causal coupling with the environment. Method: We provide the first axiomatic formalization of embedding failure in universal AI theory, grounded in Solomonoff’s prior and Bayesian decision theory. Using computability analysis and algorithmic information theory, we rigorously prove that such failure is inevitable under universal distribution assumptions. We further propose a novel distributional model over joint action-perception histories and construct a modified AIXI variant for theoretical validation. Contribution/Results: Our work establishes the first falsifiable benchmark for embedded intelligence failure, delivering both a critical theoretical warning and a formal foundation for developing robust embedded AGI systems.

Technology Category

Application Category

📝 Abstract
We rigorously discuss the commonly asserted failures of the AIXI reinforcement learning agent as a model of embedded agency. We attempt to formalize these failure modes and prove that they occur within the framework of universal artificial intelligence, focusing on a variant of AIXI that models the joint action/percept history as drawn from the universal distribution. We also evaluate the progress that has been made towards a successful theory of embedded agency based on variants of the AIXI agent.
Problem

Research questions and friction points this paper is trying to address.

Formalizing AIXI's failures in embedded agency
Proving failure modes in universal AI framework
Evaluating progress towards embedded agency theory
Innovation

Methods, ideas, or system contributions that make the work stand out.

Formalizing AIXI agent embeddedness failures rigorously
Proving failure modes in universal AI framework
Evaluating progress towards embedded agency theory
🔎 Similar Papers
No similar papers found.
C
Cole Wyeth
Cheriton School of Computer Science, University of Waterloo
Marcus Hutter
Marcus Hutter
Researcher@DeepMind & Professor at ANU
sciencephilosophyintelligenceinformationphysics