Value Under Ignorance in Universal Artificial Intelligence

📅 2025-12-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the insufficient generalization capability of utility functions within the AIXI framework—particularly when the agent’s belief distribution includes incomplete hypotheses that predict only finite prefixes of history, leaving utility assignment theoretically unjustified. To resolve this, we model beliefs as imprecise probabilities, formalizing total ignorance via submeasures and defining generalized expected utility through the Choquet integral. This work constitutes the first integration of imprecise probability theory with universal AI value functions. We introduce the “ignorance-is-fatal” principle with a dual semantic interpretation: under the weak interpretation, the model recovers standard recursive utility; under the strong interpretation, we prove that expected utility cannot be represented as a Choquet integral, thereby rigorously establishing its computability boundary. The approach provides a theoretically grounded, robust framework for utility evaluation under deep uncertainty in universal artificial intelligence.

Technology Category

Application Category

📝 Abstract
We generalize the AIXI reinforcement learning agent to admit a wider class of utility functions. Assigning a utility to each possible interaction history forces us to confront the ambiguity that some hypotheses in the agent's belief distribution only predict a finite prefix of the history, which is sometimes interpreted as implying a chance of death equal to a quantity called the semimeasure loss. This death interpretation suggests one way to assign utilities to such history prefixes. We argue that it is as natural to view the belief distributions as imprecise probability distributions, with the semimeasure loss as total ignorance. This motivates us to consider the consequences of computing expected utilities with Choquet integrals from imprecise probability theory, including an investigation of their computability level. We recover the standard recursive value function as a special case. However, our most general expected utilities under the death interpretation cannot be characterized as such Choquet integrals.
Problem

Research questions and friction points this paper is trying to address.

Generalizes AIXI to handle broader utility functions in reinforcement learning.
Addresses ambiguity in utility assignment for finite history prefixes.
Investigates expected utilities using Choquet integrals from imprecise probability theory.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generalize AIXI agent for broader utility functions
Use Choquet integrals for imprecise probability distributions
Address semimeasure loss as total ignorance in computations
🔎 Similar Papers
No similar papers found.
C
Cole Wyeth
Cheriton School of Computer Science, University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada
Marcus Hutter
Marcus Hutter
Researcher@DeepMind & Professor at ANU
sciencephilosophyintelligenceinformationphysics