Continuous-Time Analysis of Heavy Ball Momentum in Min-Max Games

📅 2025-05-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work bridges a theoretical gap in understanding Heavy-Ball (HB) momentum in min-max optimization. Despite HB’s widespread use as a core component in modern min-max algorithms (e.g., Adam), its dynamical behavior remains poorly understood. We establish continuous-time models to systematically analyze the stability, convergence, and implicit regularization of HB under both simultaneous and alternating update schemes. Our theoretical analysis reveals a counterintuitive phenomenon: small momentum values—contrary to the conventional “large momentum accelerates convergence” intuition in minimization—broaden the stable step-size region and bias trajectories toward flatter solutions. Alternating updates further amplify this effect and induce preference for regions with shallower gradient slopes. We rigorously substantiate these findings via local stability analysis, derivation of implicit regularization, and comprehensive numerical experiments.

Technology Category

Application Category

📝 Abstract
Since Polyak's pioneering work, heavy ball (HB) momentum has been widely studied in minimization. However, its role in min-max games remains largely unexplored. As a key component of practical min-max algorithms like Adam, this gap limits their effectiveness. In this paper, we present a continuous-time analysis for HB with simultaneous and alternating update schemes in min-max games. Locally, we prove smaller momentum enhances algorithmic stability by enabling local convergence across a wider range of step sizes, with alternating updates generally converging faster. Globally, we study the implicit regularization of HB, and find smaller momentum guides algorithms trajectories towards shallower slope regions of the loss landscapes, with alternating updates amplifying this effect. Surprisingly, all these phenomena differ from those observed in minimization, where larger momentum yields similar effects. Our results reveal fundamental differences between HB in min-max games and minimization, and numerical experiments further validate our theoretical results.
Problem

Research questions and friction points this paper is trying to address.

Analyzes heavy ball momentum in min-max games
Explores local and global effects of momentum size
Compares min-max game dynamics with minimization scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Continuous-time analysis of HB momentum
Smaller momentum enhances algorithmic stability
Alternating updates amplify implicit regularization
🔎 Similar Papers
No similar papers found.
Y
Yi Feng
Shanghai University of Finance and Economics, Shanghai, China
Kaito Fujii
Kaito Fujii
National Institute of Informatics
Combinatorial OptimizationMachine Learning
Stratis Skoulakis
Stratis Skoulakis
Aarhus University
Online LearningAlgorithmic Game TheoryMachine LearningOptimizationAlgorithms
X
Xiao Wang
Key Laboratory of Interdisciplinary Research of Computation and Economics, China
V
V. Cevher
EPFL, Lausanne, Switzerland