Online Set Learning from Precision and Recall Feedback

📅 2026-05-10

📈 Citations: 0

✨ Influential: 0

career value

240K/year

🤖 AI Summary

This work studies an online set learning framework in which, at each round, after predicting a subset of targets, the learner receives only partial feedback—either precision or recall—selected uniformly at random. The authors establish, for the first time, an equivalence between learnability and the finiteness of the VC dimension under this dependent feedback structure, demonstrating that standard empirical risk minimization fails in this setting. To address this challenge, they propose a novel online learning algorithm that integrates VC dimension theory with regret minimization techniques. The algorithm achieves sublinear regret bounds in both realizable and agnostic settings, thereby establishing fundamental learnability criteria for this partial-feedback model.

📝 Abstract

We consider the problem of learning an unknown subset $N_\text{target}$ of a domain in an online setting. In each round $t$, the learner predicts a set of items ${N}_t$ and receives one of two types of feedback, each with equal probability: precision feedback, in which a randomly chosen item from the predicted set $N_t$ is revealed and the learner is told whether it belongs to $N_\text{target}$ (incurring a reward if it does), or recall feedback, in which a randomly chosen item from the target set $N_\text{target}$ is revealed and the learner is told whether it belongs to $N_t$ (incurring a reward if it does). The goal is to maximize the cumulative reward over time. This simple online set learning problem abstracts a variety of learning scenarios with precision- and recall-type feedback. We show that a hypothesis class (a family of subsets of the domain) is learnable in this setting if and only if it has finite Vapnik-Chervonenkis (VC) dimension, mirroring the classical PAC characterization. However, the resulting algorithmic structure is markedly more intricate: in contrast to standard Probably Approximately Correct (PAC) learning -- where the algorithmic landscape is governed by the simple principle of Empirical Risk Minimization (ERM) -- our partial feedback model can invalidate ERM and even all proper learning rules. We develop algorithms to address the dependencies induced by the feedback, obtaining regret guarantees in both the realizable and agnostic settings. Our results provide a qualitative characterization of learnability in this model, addressing its most basic question, while pointing to a range of natural and intriguing open questions, including the determination of optimal regret rates.

Problem

Research questions and friction points this paper is trying to address.

online set learning

precision feedback

recall feedback

VC dimension

partial feedback

Innovation

Methods, ideas, or system contributions that make the work stand out.

online set learning

precision-recall feedback

VC dimension