Solving Imperfect-Recall Games via Sum-of-Squares Optimization

📅 2026-02-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the failure of conventional equilibrium computation methods in imperfect-recall extensive-form games due to information loss, proposing the first behavior-strategy-based Sum-of-Squares (SOS) optimization hierarchy. By leveraging the structure of information sets, the framework efficiently computes optimal strategies for single-player settings and Nash equilibria in multi-player games via semidefinite programming (SDP). The key contributions include the introduction of new classes of games—namely, (SOS)-concave and (SOS)-monotone games—and theoretical guarantees that the SOS hierarchy converges asymptotically in general cases and finitely in non-forgetful single-player games. For these newly defined game classes, a single SDP suffices to compute equilibria exactly, substantially enhancing both computational efficiency and applicability.

Technology Category

Application Category

📝 Abstract
Extensive-form games (EFGs) provide a powerful framework for modeling sequential decision making, capturing strategic interaction under imperfect information, chance events, and temporal structure. Most positive algorithmic and theoretical results for EFGs assume perfect recall, where players remember all past information and actions. We study the increasingly relevant setting of imperfect-recall EFGs (IREFGs), where players may forget parts of their history or previously acquired information, and where equilibrium computation is provably hard. We propose sum-of-squares (SOS) hierarchies for computing ex-ante optimal strategies in single-player IREFGs and Nash equilibria in multi-player IREFGs, working over behavioral strategies. Our theoretical results show that (i) these hierarchies converge asymptotically, (ii) under genericity assumptions, the convergence is finite, and (iii) in single-player non-absentminded IREFGs, convergence occurs at a finite level determined by the number of information sets. Finally, we introduce the new classes of (SOS)-concave and (SOS)-monotone IREFGs, and show that in the single-player setting the SOS hierarchy converges at the first level, enabling equilibrium computation with a single semidefinite program (SDP).
Problem

Research questions and friction points this paper is trying to address.

Imperfect-recall games
Extensive-form games
Equilibrium computation
Behavioral strategies
Nash equilibrium
Innovation

Methods, ideas, or system contributions that make the work stand out.

sum-of-squares optimization
imperfect-recall games
extensive-form games
behavioral strategies
semidefinite programming
🔎 Similar Papers
No similar papers found.
R
Rui Zheng
Singapore University of Technology and Design, Singapore
R
Ryann Sim
National University of Singapore, Singapore
Antonios Varvitsiotis
Antonios Varvitsiotis
Singapore University of Technology and Design
Mathematical OptimizationMachine LearningQuantum Computing