🤖 AI Summary
This work bridges a theoretical gap in understanding Heavy-Ball (HB) momentum in min-max optimization. Despite HB’s widespread use as a core component in modern min-max algorithms (e.g., Adam), its dynamical behavior remains poorly understood. We establish continuous-time models to systematically analyze the stability, convergence, and implicit regularization of HB under both simultaneous and alternating update schemes. Our theoretical analysis reveals a counterintuitive phenomenon: small momentum values—contrary to the conventional “large momentum accelerates convergence” intuition in minimization—broaden the stable step-size region and bias trajectories toward flatter solutions. Alternating updates further amplify this effect and induce preference for regions with shallower gradient slopes. We rigorously substantiate these findings via local stability analysis, derivation of implicit regularization, and comprehensive numerical experiments.
📝 Abstract
Since Polyak's pioneering work, heavy ball (HB) momentum has been widely studied in minimization. However, its role in min-max games remains largely unexplored. As a key component of practical min-max algorithms like Adam, this gap limits their effectiveness. In this paper, we present a continuous-time analysis for HB with simultaneous and alternating update schemes in min-max games. Locally, we prove smaller momentum enhances algorithmic stability by enabling local convergence across a wider range of step sizes, with alternating updates generally converging faster. Globally, we study the implicit regularization of HB, and find smaller momentum guides algorithms trajectories towards shallower slope regions of the loss landscapes, with alternating updates amplifying this effect. Surprisingly, all these phenomena differ from those observed in minimization, where larger momentum yields similar effects. Our results reveal fundamental differences between HB in min-max games and minimization, and numerical experiments further validate our theoretical results.