Partial Feedback Online Learning

📅 2026-01-29

📈 Citations: 0

✨ Influential: 0

career value

220K/year

🤖 AI Summary

This work investigates online learning with partial feedback, where only a single correct label is observed per round despite the existence of multiple valid labels. Under the set realizability assumption, the authors introduce two new complexity measures—the Partial-Feedback Littlestone dimension (PFLdim) and the Partial-Feedback Measure Shattering dimension (PMSdim)—which precisely characterize the minimax regret for deterministic and randomized learners, respectively, and reveal their indistinguishability under broad conditions. Tight regret bounds are established via set-valued version spaces, auxiliary dimension constructions, and combinatorial information-theoretic tools such as Helly numbers and nested structures. Furthermore, the study demonstrates that the problem may become information-theoretically unsolvable in the non-realizable setting, thereby underscoring the necessity of the proposed dimensions and resolving a key open question in set-valued online learning.

Technology Category

Application Category

📝 Abstract

We study a new learning protocol, termed partial-feedback online learning, where each instance admits a set of acceptable labels, but the learner observes only one acceptable label per round. We highlight that, while classical version space is widely used for online learnability, it does not directly extend to this setting. We address this obstacle by introducing a collection version space, which maintains sets of hypotheses rather than individual hypotheses. Using this tool, we obtain a tight characterization of learnability in the set-realizable regime. In particular, we define the Partial-Feedback Littlestone dimension (PFLdim) and the Partial-Feedback Measure Shattering dimension (PMSdim), and show that they tightly characterize the minimax regret for deterministic and randomized learners, respectively. We further identify a nested inclusion condition under which deterministic and randomized learnability coincide, resolving an open question of Raman et al. (2024b). Finally, given a hypothesis space H, we show that beyond set realizability, the minimax regret can be linear even when |H|=2, highlighting a barrier beyond set realizability.

Problem

Research questions and friction points this paper is trying to address.

partial feedback

online learning

minimax regret

set realizability

learnability

Innovation

Methods, ideas, or system contributions that make the work stand out.

Partial-Feedback Littlestone dimension

Minimax regret

Collection version space