Formalizing Embeddedness Failures in Universal Artificial Intelligence

📅 2025-05-23

📈 Citations: 0

✨ Influential: 0

career value

248K/year

🤖 AI Summary

This paper addresses a fundamental failure mode of the AIXI agent under the embedded intelligence paradigm: its structural collapse—manifesting as self-referential inconsistency, resource-agnosticism, and model self-destruction—arising from neglecting its own physical embedding and causal coupling with the environment. Method: We provide the first axiomatic formalization of embedding failure in universal AI theory, grounded in Solomonoff’s prior and Bayesian decision theory. Using computability analysis and algorithmic information theory, we rigorously prove that such failure is inevitable under universal distribution assumptions. We further propose a novel distributional model over joint action-perception histories and construct a modified AIXI variant for theoretical validation. Contribution/Results: Our work establishes the first falsifiable benchmark for embedded intelligence failure, delivering both a critical theoretical warning and a formal foundation for developing robust embedded AGI systems.

Technology Category

Application Category

📝 Abstract

We rigorously discuss the commonly asserted failures of the AIXI reinforcement learning agent as a model of embedded agency. We attempt to formalize these failure modes and prove that they occur within the framework of universal artificial intelligence, focusing on a variant of AIXI that models the joint action/percept history as drawn from the universal distribution. We also evaluate the progress that has been made towards a successful theory of embedded agency based on variants of the AIXI agent.

Problem

Research questions and friction points this paper is trying to address.

Formalizing AIXI's failures in embedded agency

Proving failure modes in universal AI framework

Evaluating progress towards embedded agency theory

Innovation

Methods, ideas, or system contributions that make the work stand out.

Formalizing AIXI agent embeddedness failures rigorously

Proving failure modes in universal AI framework

Evaluating progress towards embedded agency theory

🔎 Similar Papers

Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?