A Rate-Distortion Framework for Summarization

📅 2025-01-22

📈 Citations: 0

✨ Influential: 0

career value

207K/year

🤖 AI Summary

This work addresses the fundamental trade-off between information fidelity and compression ratio in text summarization. We systematically introduce rate-distortion theory to summarization for the first time, establishing the first information-theoretic rate-distortion framework for this task. We formally define the rate-distortion function of a summarizer, characterizing its intrinsic performance lower bound; propose a computable theoretical lower bound estimator and a Blahut–Arimoto-type iterative optimization algorithm; and design a practical estimator tailored for low-resource settings. Empirical evaluation demonstrates that the derived theoretical lower bound strongly correlates with the actual performance of state-of-the-art summarization models—including BART, PEGASUS, and LLaMA-3—significantly outperforming conventional metrics (e.g., ROUGE, BERTScore) in correlation strength. This work provides an information-theoretic foundation for summary quality assessment and offers interpretable, principled optimization criteria for model design and compression.

Technology Category

Application Category

📝 Abstract

This paper introduces an information-theoretic framework for text summarization. We define the summarizer rate-distortion function and show that it provides a fundamental lower bound on summarizer performance. We describe an iterative procedure, similar to Blahut-Arimoto algorithm, for computing this function. To handle real-world text datasets, we also propose a practical method that can calculate the summarizer rate-distortion function with limited data. Finally, we empirically confirm our theoretical results by comparing the summarizer rate-distortion function with the performances of different summarizers used in practice.

Problem

Research questions and friction points this paper is trying to address.

Rate-Distortion Theory

Text Summarization

Information Loss

Innovation

Methods, ideas, or system contributions that make the work stand out.

rate-distortion framework

recursive Blahut-Arimoto algorithm

limited data computation method

🔎 Similar Papers

No similar papers found.