Thresholds for sensitive optimality and Blackwell optimality in stochastic games

📅 2025-06-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper investigates strategy refinement for two-player zero-sum perfect-information stochastic games under the average-payoff criterion, focusing on characterizing the Blackwell optimality threshold α_Bw and the d-sensitive optimality threshold α_d. Addressing the limitations of existing bounds—namely, their conservatism and high computational complexity—we derive, for the first time, an effective upper bound on α_d for all d > −1, and significantly improve the upper bound on α_Bw. Methodologically, we innovatively integrate algebraic tools—including separation bounds for algebraic numbers, Lagrange’s bound, Mahler’s measure, and multiplicity theorems—to rigorously analyze the range of discount factors over which optimal strategies remain invariant. Our theoretical results provide formal guarantees for efficiently computing Blackwell-optimal and d-sensitive-optimal strategies, substantially reducing algorithmic complexity and advancing the practical feasibility of long-run discounted analysis in algorithmic game theory.

Technology Category

Application Category

📝 Abstract
We investigate refinements of the mean-payoff criterion in two-player zero-sum perfect-information stochastic games. A strategy is Blackwell optimal if it is optimal in the discounted game for all discount factors sufficiently close to $1$. The notion of $d$-sensitive optimality interpolates between mean-payoff optimality (corresponding to the case $d=-1$) and Blackwell optimality ($d=+infty$). The Blackwell threshold $α_{sf Bw} in [0,1[$ is the discount factor above which all optimal strategies in the discounted game are guaranteed to be Blackwell optimal. The $d$-sensitive threshold $α_{sf d} in [0,1[$ is defined analogously. Bounding $α_{sf Bw}$ and $α_{sf d}$ are fundamental problems in algorithmic game theory, since these thresholds control the complexity for computing Blackwell and $d$-sensitive optimal strategies, by reduction to discounted games which can be solved in $Oleft((1-α)^{-1} ight)$ iterations. We provide the first bounds on the $d$-sensitive threshold $α_{sf d}$ beyond the case $d=-1$, and we establish improved bounds for the Blackwell threshold $α_{sf Bw}$. This is achieved by leveraging separation bounds on algebraic numbers, relying on Lagrange bounds and more advanced techniques based on Mahler measures and multiplicity theorems.
Problem

Research questions and friction points this paper is trying to address.

Study Blackwell and d-sensitive optimality thresholds in stochastic games
Bound thresholds to compute optimal strategies efficiently
Use algebraic number theory to improve threshold bounds
Innovation

Methods, ideas, or system contributions that make the work stand out.

Blackwell optimality for discounted games
d-sensitive optimality interpolation method
algebraic number separation bounds
🔎 Similar Papers
No similar papers found.
S
Stéphane Gaubert
INRIA and CMAP, Ecole Polytechnique
Julien Grand-Clément
Julien Grand-Clément
HEC Paris
optimizationgame theoryhealthcarecomputer science
R
Ricardo D. Katz
CIFASIS-CONICET