When and Why is Optimistic Multiplicative Weights Slow? The Geometry of Energy Dissipation

📅 2026-05-13

📈 Citations: 0

✨ Influential: 0

career value

186K/year

🤖 AI Summary

This work investigates the geometric origins of the slow last-iterate convergence of the Optimistic Multiplicative Weights Update (OMWU) algorithm in two-player zero-sum games. By interpreting OMWU’s dual iterates as optimistic mirror descent on an energy function, the authors develop a novel analysis framework based on energy dissipation, which reveals a geometric bottleneck arising when primal iterates approach the boundary of the simplex. This framework provides the first quantitative characterization of the slow convergence mechanism and yields a linear last-iterate convergence rate with sharp dependence on the game constants. The study further establishes that convergence rates are not transferable across different distance metrics and improves the optimal duality gap rate to Õ(T⁻¹/²) in 2×2 games, matching the theoretical lower bound.

📝 Abstract

This paper studies the convergence of the Optimistic Multiplicative Weights Update algorithm (OMWU) in two player zero-sum games. Recent works have identified instances on which the last-iterate of OMWU can converge arbitrarily slowly, but understanding when and why this slow convergence occurs has remained open. In this work, we develop a new analysis framework that gives sharp, quantitative explanations for this behavior. Our analysis is based on viewing the algorithm's dual iterates as an optimistic skew-gradient descent with respect to an energy function. We prove over the dual iterates that energy is dissipative, and by establishing tight bounds on the magnitude of dissipation, our analysis quantifies the geometric bottlenecks that arise when the corresponding primal iterates are close to the simplex boundary. This further translates into a new linear last-iterate convergence rate in KL divergence on games with a unique and interior Nash equilibrium. Compared to prior work, this new rate contains a much sharper dependence on game-specific constants, and we prove this dependence is optimal. Moreover, these geometric insights further translate into new separations on uniform convergence rates for OMWU. On the one hand, we prove constant lower bounds on the uniform best-iterate convergence rate in KL divergence and total variation distance from Nash. On the other hand, we establish for the $2\times 2$ setting a new ${\widetilde O}(T^{-1/2})$ best-iterate rate in duality gap, improving substantially over prior work. Together, this shows in general that uniform convergence rate guarantees do not transfer across different measures of distance to Nash.

Problem

Research questions and friction points this paper is trying to address.

Optimistic Multiplicative Weights Update

convergence rate

zero-sum games

last-iterate convergence

Nash equilibrium

Innovation

Methods, ideas, or system contributions that make the work stand out.

Optimistic Multiplicative Weights Update

energy dissipation

last-iterate convergence