Identifying All ε-Best Arms in (Misspecified) Linear Bandits

📅 2025-09-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In high-cost exploration settings—such as drug discovery—efficiently identifying all ε-optimal arms in linear stochastic bandits is critical yet challenging. Method: We propose LinFACT, the first algorithm for ε-optimal arm identification in linear bandits that achieves instance optimality. It establishes a novel information-theoretic lower bound on sample complexity under the linear bandit model and integrates this bound into an adaptive sampling framework. The resulting algorithm yields a matching upper bound on sample complexity, accommodates model misspecification, and extends naturally to generalized linear models. Results: LinFACT’s termination time is governed by an information bottleneck, achieving sample complexity optimal up to logarithmic factors. Experiments on synthetic benchmarks and real-world drug discovery datasets demonstrate that LinFACT identifies more high-quality candidates at significantly lower sampling cost, substantially improving early-stage exploration efficiency.

Technology Category

Application Category

📝 Abstract
Motivated by the need to efficiently identify multiple candidates in high trial-and-error cost tasks such as drug discovery, we propose a near-optimal algorithm to identify all ε-best arms (i.e., those at most ε worse than the optimum). Specifically, we introduce LinFACT, an algorithm designed to optimize the identification of all ε-best arms in linear bandits. We establish a novel information-theoretic lower bound on the sample complexity of this problem and demonstrate that LinFACT achieves instance optimality by matching this lower bound up to a logarithmic factor. A key ingredient of our proof is to integrate the lower bound directly into the scaling process for upper bound derivation, determining the termination round and thus the sample complexity. We also extend our analysis to settings with model misspecification and generalized linear models. Numerical experiments, including synthetic and real drug discovery data, demonstrate that LinFACT identifies more promising candidates with reduced sample complexity, offering significant computational efficiency and accelerating early-stage exploratory experiments.
Problem

Research questions and friction points this paper is trying to address.

Identifying near-optimal arms in linear bandits efficiently
Optimizing sample complexity for multi-candidate selection tasks
Addressing model misspecification in generalized linear bandits
Innovation

Methods, ideas, or system contributions that make the work stand out.

LinFACT algorithm identifies near-optimal arms efficiently
Matches information-theoretic lower bound for sample complexity
Extends analysis to misspecified and generalized linear models
🔎 Similar Papers
No similar papers found.
Zhekai Li
Zhekai Li
Msc in The Chinese University of Hong Kong, Shenzhen
speech synthesissinging voice synthesis
T
Tianyi Ma
School of Operations Research and Information Engineering, Cornell University
C
Cheng Hua
Antai College of Economics and Management, Shanghai Jiao Tong University
Ruihao Zhu
Ruihao Zhu
Cornell University
Business AnalyticsRevenue ManagementSupply Chain