Online Non-convex Optimization with Long-term Non-convex Constraints

📅 2023-11-04

📈 Citations: 0

✨ Influential: 0

career value

218K/year

🤖 AI Summary

This paper studies online non-convex optimization under adversarial environments, where both the objective function and long-term non-convex constraints are time-varying and coupled. To address this challenge, we propose a novel Follow-the-Perturbed-Leader (FPL) algorithm incorporating stochastic linear perturbations on primal variables and strongly concave perturbations on dual variables; each round invokes an offline oracle to compute a global minimax point. We establish, for the first time, an expected static regret analysis framework for such non-convex, constrained online settings, achieving an optimal $O(T^{8/9})$ sublinear cumulative regret bound—breaking both the conventional convexity assumption and short-term constraint restrictions. Empirical evaluation on river pollution source identification—featuring extreme-value constraints—demonstrates that our method significantly improves long-term constraint satisfaction and decision quality over state-of-the-art baselines, simultaneously attaining low regret and strong feasibility.

📝 Abstract

A novel Follow-the-Perturbed-Leader type algorithm is proposed and analyzed for solving general long-term constrained optimization problems in online manner, where the objective and constraints are arbitrarily generated and not necessarily convex. In each period, random linear perturbation and strongly concave perturbation are incorporated in primal and dual directions, respectively, to the offline oracle, and a global minimax point is searched as the solution. Based on a proposed expected static cumulative regret, we derive the first sublinear $O(T^{8/9})$ regret complexity for this class of problems. The proposed algorithm is applied to tackle a long-term (extreme value) constrained river pollutant source identification problem, validate the theoretical results and exhibit superior performance compared to existing methods.

Problem

Research questions and friction points this paper is trying to address.

Solves online non-convex optimization with adversarial constraints

Handles non-convexity via primal perturbations and dual regularizations

Achieves sublinear regret for long-term constrained identification problems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Follow-the-Perturbed-Leader algorithm for non-convex optimization

Exponential random perturbations handle primal non-convexity

Logarithmic dual regularizations manage constraint violations

🔎 Similar Papers

Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously