Bandits attack function optimization

📅 2026-05-05

📈 Citations: 0

✨ Influential: 0

career value

241K/year

🤖 AI Summary

This work addresses black-box function optimization under a limited budget of function evaluations by formulating it as a constrained sequential decision-making problem and proposes the Simultaneous Optimistic Optimization (SOO) algorithm. SOO integrates a deterministic partitioning strategy with a multi-armed bandit framework, leveraging an optimistic exploration mechanism to effectively balance global exploration and local exploitation—all without requiring gradient information. Experimental results on the CEC’2014 single-objective real-parameter benchmark suite demonstrate that SOO achieves competitive performance in both solution quality and numerical efficiency. The approach thus establishes a theoretically grounded and practically viable paradigm for gradient-free black-box optimization.

📝 Abstract

We consider function optimization as a sequential decision making problem under budget constraint. This constraint limits the number of objective function evaluations allowed during the optimization. We consider an algorithm inspired by a continuous version of a multi-armed bandit problem which attacks this optimization problem by solving the tradeoff between exploration (initial quasi-uniform search of the domain) and exploitation (local optimization around the potentially global maxima). We introduce the so-called Simultaneous Optimistic Optimization (SOO), a deterministic algorithm that works by domain partitioning. The benefit of such approach are the guarantees on the returned solution and the numerical efficiency of the algorithm. We present this machine learning approach to optimization, and provide the empirical assessment of SOO on the CEC'2014 competition on single objective real-parameter numerical optimization test-suite.

Problem

Research questions and friction points this paper is trying to address.

function optimization

budget constraint

exploration-exploitation tradeoff

multi-armed bandit

sequential decision making

Innovation

Methods, ideas, or system contributions that make the work stand out.

Simultaneous Optimistic Optimization

multi-armed bandit

function optimization