Learning with Episodic Hypothesis Testing in General Games: A Framework for Equilibrium Selection

📅 2025-07-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses equilibrium selection in finite normal-form games by proposing a multi-agent learning dynamic grounded in statistical hypothesis testing, designed to converge to a Nash equilibrium that maximizes the minimum (transformed) utility across all players. Methodologically, agents periodically test hypotheses about opponents’ strategies, updating their own policies via empirical observations and belief resampling, while incorporating a utility-dependent exploration decay mechanism to jointly optimize belief refinement and exploration. The key contribution is the first integration of statistical hypothesis testing into game-theoretic learning—enabling endogenous selection of highly robust equilibria without external refinement criteria. Theoretically and empirically, the algorithm converges to an approximate Nash equilibrium set in general finite games and consistently favors solutions that improve the global minimum utility. This provides a novel, interpretable, and adaptive paradigm for equilibrium selection.

Technology Category

Application Category

📝 Abstract
We introduce a new hypothesis testing-based learning dynamics in which players update their strategies by combining hypothesis testing with utility-driven exploration. In this dynamics, each player forms beliefs about opponents' strategies and episodically tests these beliefs using empirical observations. Beliefs are resampled either when the hypothesis test is rejected or through exploration, where the probability of exploration decreases with the player's (transformed) utility. In general finite normal-form games, we show that the learning process converges to a set of approximate Nash equilibria and, more importantly, to a refinement that selects equilibria maximizing the minimum (transformed) utility across all players. Our result establishes convergence to equilibrium in general finite games and reveals a novel mechanism for equilibrium selection induced by the structure of the learning dynamics.
Problem

Research questions and friction points this paper is trying to address.

Develops hypothesis testing-based learning for equilibrium selection
Converges to approximate Nash equilibria in general games
Selects equilibria maximizing minimum utility across players
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hypothesis testing-based learning dynamics
Utility-driven exploration strategy updates
Convergence to refined Nash equilibria
🔎 Similar Papers
No similar papers found.
R
Ruifan Yang
School of Operations Research and Information Engineering, Cornell University
Manxi Wu
Manxi Wu
University of California, Berkeley
Game TheoryMulti-agent LearningMechanism DesignSocietal Network