Bayesian Online Multiple Testing: A Resource Allocation Approach

📅 2024-02-18

📈 Citations: 0

✨ Influential: 0

career value

195K/year

🤖 AI Summary

This paper studies a novel online resource allocation problem where each accepted request must satisfy an *average* resource consumption constraint—rather than the conventional total budget constraint—and formalizes it as an online multiple hypothesis testing problem with exogenous budget updates, aiming to maximize discoveries while controlling the local false discovery rate (LFDR) in real time. We propose a new strategy incorporating an adaptive budget safety buffer; remarkably, only a logarithmic buffer size suffices to reduce regret from Ω(√T) or Ω(T) to O(log²T), revealing for the first time the fundamental failure of classical re-optimization heuristics in this setting. We prove the regret bound is tight (i.e., information-theoretically optimal). Extensive experiments on synthetic data and real-world NYC taxi trip time-series data demonstrate that our method significantly outperforms state-of-the-art baselines, achieving both statistical rigor and real-time decision efficiency.

Technology Category

Application Category

📝 Abstract

We consider the problem of sequentially conducting multiple experiments where each experiment corresponds to a hypothesis testing task. At each time point, the experimenter must make an irrevocable decision of whether to reject the null hypothesis (or equivalently claim a discovery) before the next experimental result arrives. The goal is to maximize the number of discoveries while maintaining a low error rate at all time points measured by Local False Discovery Rate (LFDR). We formulate the problem as an online knapsack problem with exogenous random budget replenishment. We start with general arrival distributions and show that a simple policy achieves a $O(sqrt{T})$ regret. We complement the result by showing that such regret rate is in general not improvable. We then shift our focus to discrete arrival distributions. We find that many existing re-solving heuristics in the online resource allocation literature, albeit achieve bounded loss in canonical settings, may incur a $Omega(sqrt{T})$ or even a $Omega(T)$ regret. With the observation that canonical policies tend to be too optimistic and over claim discoveries, we propose a novel policy that incorporates budget safety buffers. It turns out that a little more safety can greatly enhance efficiency -- small additional logarithmic buffers suffice to reduce the regret from $Omega(sqrt{T})$ or even $Omega(T)$ to $O(ln^2 T)$. From a practical perspective, we extend the policy to the scenario with continuous arrival distributions, time-dependent information structures, as well as unknown $T$. We conduct both synthetic experiments and empirical applications on a time series data from New York City taxi passengers to validate the performance of our proposed policies. Our results emphasize how effective policies should be designed in online resource allocation problems with exogenous budget replenishment.

Problem

Research questions and friction points this paper is trying to address.

Optimizing online resource allocation under average budget constraints per request

Developing efficient policies to minimize regret in sequential decision-making

Addressing limitations of existing heuristics through safety buffer mechanisms

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses average budget constraints per accepted request

Introduces budget safety buffers to reduce regret

Extends policy to continuous arrivals and unknown T

🔎 Similar Papers

Simultaneous inference for generalized linear models with unmeasured confounders