Accelerated Stochastic Min-Max Optimization Based on Bias-corrected Momentum

📅 2024-06-18

🏛️ arXiv.org

📈 Citations: 1

✨ Influential: 0

career value

208K/year

🤖 AI Summary

This work addresses nonconvex-strongly-concave (or nonconvex-PL) stochastic minimax optimization, aiming to break the existing $O(varepsilon^{-4})$ stochastic first-order oracle complexity barrier. We propose the first bias-corrected momentum framework tailored to this setting, which integrates efficient Hessian-vector product estimation with stochastic first-order updates—requiring neither second-order derivatives nor additional memory overhead. Theoretically, we establish an improved iteration complexity of $O(varepsilon^{-3})$, achieving the first acceleration result for this class of problems. Empirical evaluation on robust logistic regression confirms substantial improvements in convergence rate, with strong alignment between theoretical guarantees and practical performance.

Technology Category

Application Category

📝 Abstract

Lower-bound analyses for nonconvex strongly-concave minimax optimization problems have shown that stochastic first-order algorithms require at least $mathcal{O}(varepsilon^{-4})$ oracle complexity to find an $varepsilon$-stationary point. Some works indicate that this complexity can be improved to $mathcal{O}(varepsilon^{-3})$ when the loss gradient is Lipschitz continuous. The question of achieving enhanced convergence rates under distinct conditions, remains unresolved. In this work, we address this question for optimization problems that are nonconvex in the minimization variable and strongly concave or Polyak-Lojasiewicz (PL) in the maximization variable. We introduce novel bias-corrected momentum algorithms utilizing efficient Hessian-vector products. We establish convergence conditions and demonstrate a lower iteration complexity of $mathcal{O}(varepsilon^{-3})$ for the proposed algorithms. The effectiveness of the method is validated through applications to robust logistic regression using real-world datasets.

Problem

Research questions and friction points this paper is trying to address.

Improving convergence rates for nonconvex-strongly-concave minimax optimization

Reducing oracle complexity to O(ε⁻³) using bias-corrected momentum

Validating method via robust logistic regression on real-world datasets

Innovation

Methods, ideas, or system contributions that make the work stand out.

Bias-corrected momentum for min-max optimization

Utilizes efficient Hessian-vector products

Achieves lower iteration complexity O(ε⁻³)

🔎 Similar Papers

No similar papers found.