Exploration-Exploitation Tradeoff in Universal Lossy Compression

📅 2025-06-25

📈 Citations: 0

✨ Influential: 0

career value

206K/year

🤖 AI Summary

This work addresses the exploration-exploitation trade-off in sequential forward/backward adaptation for universal lossy compression, particularly under short codewords and non-stationary sources—where robustness remains a critical bottleneck. We introduce, for the first time, multi-armed bandit (MAB) modeling into the sequential adaptive process of lossy compression, proposing a cost-aware robust MAB algorithm that replaces conventional natural-type selection mechanisms. The method enables joint online learning and reconstruction optimization for arbitrary codeword lengths without requiring source priors. Theoretical analysis establishes its asymptotic optimality in non-stationary environments. Experiments demonstrate substantial improvements in both rate-distortion performance and reconstruction robustness for short-block scenarios, overcoming the limited few-shot adaptability inherent in existing approaches.

Technology Category

Application Category

📝 Abstract

Universal compression can learn the source and adapt to it either in a batch mode (forward adaptation), or in a sequential mode (backward adaptation). We recast the sequential mode as a multi-armed bandit problem, a fundamental model in reinforcement-learning, and study the trade-off between exploration and exploitation in the lossy compression case. We show that a previously proposed "natural type selection" scheme can be cast as a reconstruction-directed MAB algorithm, for sequential lossy compression, and explain its limitations in terms of robustness and short-block performance. We then derive and analyze robust cost-directed MAB algorithms, which work at any block length.

Problem

Research questions and friction points this paper is trying to address.

Explores exploration-exploitation tradeoff in lossy compression

Analyzes natural type selection as MAB algorithm limitations

Derives robust cost-directed MAB algorithms for any block length

Innovation

Methods, ideas, or system contributions that make the work stand out.

Recasts sequential mode as multi-armed bandit problem

Proposes reconstruction-directed MAB algorithm

Derives robust cost-directed MAB algorithms

🔎 Similar Papers

A Survey on Error-Bounded Lossy Compression for Scientific Datasets