$O(sqrt{T})$ Static Regret and Instance Dependent Constraint Violation for Constrained Online Convex Optimization

📅 2025-02-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper studies Constrained Online Convex Optimization (COCO), aiming to jointly minimize static regret and cumulative constraint violation (CCV). For settings where convex loss and convex constraint functions are revealed each round, we propose a novel projection-based gradient algorithm. Our theoretical contributions are threefold: (1) achieving the optimal $O(sqrt{T})$ static regret; (2) introducing the first instance-dependent CCV upper bound $min{mathcal{V}, O(sqrt{T} log T)}$, where $mathcal{V}$ is governed by geometric properties of the constraint set—such as continuity distance, dimension, and diameter—and attains $O(1)$ under smooth or stable constraints, improving upon the prior universal $O(sqrt{T} log T)$ bound; and (3) enhancing tightness and practicality for time-varying resource constraints via dynamic constraint set modeling and geometric analysis. Experiments demonstrate superior empirical performance.

Technology Category

Application Category

📝 Abstract
The constrained version of the standard online convex optimization (OCO) framework, called COCO is considered, where on every round, a convex cost function and a convex constraint function are revealed to the learner after it chooses the action for that round. The objective is to simultaneously minimize the static regret and cumulative constraint violation (CCV). An algorithm is proposed that guarantees a static regret of $O(sqrt{T})$ and a CCV of $min{cV, O(sqrt{T}log T) }$, where $cV$ depends on the distance between the consecutively revealed constraint sets, the shape of constraint sets, dimension of action space and the diameter of the action space. For special cases of constraint sets, $cV=O(1)$. Compared to the state of the art results, static regret of $O(sqrt{T})$ and CCV of $O(sqrt{T}log T)$, that were universal, the new result on CCV is instance dependent, which is derived by exploiting the geometric properties of the constraint sets.
Problem

Research questions and friction points this paper is trying to address.

Minimizing static regret
Reducing constraint violation
Instance-dependent optimization algorithm
Innovation

Methods, ideas, or system contributions that make the work stand out.

Minimizes static regret
Reduces constraint violation
Exploits geometric properties
🔎 Similar Papers
No similar papers found.
Rahul Vaze
Rahul Vaze
Associate Professor, Electrical Engineering, Tata Institute of Fundamental Research, Mumbai, India.
Wireless CommunicationInformation TheoryStatistical Learning
A
Abhishek Sinha
School of Technology and Computer Science, Tata Institute of Fundamental Research, Mumbai 400005, India