Rate-optimal Design for Anytime Best Arm Identification

📅 2025-10-27

📈 Citations: 0

✨ Influential: 0

career value

231K/year

🤖 AI Summary

This paper addresses best-arm identification (BAI) in multi-armed bandits under a finite sampling budget, motivated by practical applications such as A/B testing. We propose the Almost Tracking algorithm—a novel, anytime BAI method that achieves asymptotic optimality with respect to the H₁ risk measure. Unlike prior approaches, it requires no prespecified total budget, permits stopping at any time, and fully utilizes all collected samples without discarding any data. The algorithm employs a closed-form sampling rule coupled with an adaptive tracking mechanism to dynamically optimize cumulative sample allocation. Grounded in a min-max optimal framework, it ensures both theoretical rigor and computational efficiency. Experiments on synthetic and real-world datasets demonstrate that Almost Tracking significantly outperforms existing fixed-budget and anytime BAI methods, empirically validating its theoretical optimality and practical superiority.

Technology Category

Application Category

📝 Abstract

We consider the best arm identification problem, where the goal is to identify the arm with the highest mean reward from a set of $K$ arms under a limited sampling budget. This problem models many practical scenarios such as A/B testing. We consider a class of algorithms for this problem, which is provably minimax optimal up to a constant factor. This idea is a generalization of existing works in fixed-budget best arm identification, which are limited to a particular choice of risk measures. Based on the framework, we propose Almost Tracking, a closed-form algorithm that has a provable guarantee on the popular risk measure $H_1$. Unlike existing algorithms, Almost Tracking does not require the total budget in advance nor does it need to discard a significant part of samples, which gives a practical advantage. Through experiments on synthetic and real-world datasets, we show that our algorithm outperforms existing anytime algorithms as well as fixed-budget algorithms.

Problem

Research questions and friction points this paper is trying to address.

Optimizing anytime best arm identification under limited sampling budget

Generalizing risk measures beyond fixed-budget approaches

Developing practical algorithms without requiring total budget knowledge

Innovation

Methods, ideas, or system contributions that make the work stand out.

Generalizes fixed-budget best arm identification framework

Proposes closed-form Almost Tracking algorithm

No prior budget knowledge or sample discarding required

🔎 Similar Papers

Optimal Multi-Fidelity Best-Arm Identification