Rate-optimal Design for Anytime Best Arm Identification

πŸ“… 2025-10-27
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper addresses best-arm identification (BAI) in multi-armed bandits under a finite sampling budget, motivated by practical applications such as A/B testing. We propose the Almost Tracking algorithmβ€”a novel, anytime BAI method that achieves asymptotic optimality with respect to the H₁ risk measure. Unlike prior approaches, it requires no prespecified total budget, permits stopping at any time, and fully utilizes all collected samples without discarding any data. The algorithm employs a closed-form sampling rule coupled with an adaptive tracking mechanism to dynamically optimize cumulative sample allocation. Grounded in a min-max optimal framework, it ensures both theoretical rigor and computational efficiency. Experiments on synthetic and real-world datasets demonstrate that Almost Tracking significantly outperforms existing fixed-budget and anytime BAI methods, empirically validating its theoretical optimality and practical superiority.

Technology Category

Application Category

πŸ“ Abstract
We consider the best arm identification problem, where the goal is to identify the arm with the highest mean reward from a set of $K$ arms under a limited sampling budget. This problem models many practical scenarios such as A/B testing. We consider a class of algorithms for this problem, which is provably minimax optimal up to a constant factor. This idea is a generalization of existing works in fixed-budget best arm identification, which are limited to a particular choice of risk measures. Based on the framework, we propose Almost Tracking, a closed-form algorithm that has a provable guarantee on the popular risk measure $H_1$. Unlike existing algorithms, Almost Tracking does not require the total budget in advance nor does it need to discard a significant part of samples, which gives a practical advantage. Through experiments on synthetic and real-world datasets, we show that our algorithm outperforms existing anytime algorithms as well as fixed-budget algorithms.
Problem

Research questions and friction points this paper is trying to address.

Optimizing anytime best arm identification under limited sampling budget
Generalizing risk measures beyond fixed-budget approaches
Developing practical algorithms without requiring total budget knowledge
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generalizes fixed-budget best arm identification framework
Proposes closed-form Almost Tracking algorithm
No prior budget knowledge or sample discarding required
πŸ”Ž Similar Papers
No similar papers found.