A Rate-Distortion Framework for Summarization

📅 2025-01-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the fundamental trade-off between information fidelity and compression ratio in text summarization. We systematically introduce rate-distortion theory to summarization for the first time, establishing the first information-theoretic rate-distortion framework for this task. We formally define the rate-distortion function of a summarizer, characterizing its intrinsic performance lower bound; propose a computable theoretical lower bound estimator and a Blahut–Arimoto-type iterative optimization algorithm; and design a practical estimator tailored for low-resource settings. Empirical evaluation demonstrates that the derived theoretical lower bound strongly correlates with the actual performance of state-of-the-art summarization models—including BART, PEGASUS, and LLaMA-3—significantly outperforming conventional metrics (e.g., ROUGE, BERTScore) in correlation strength. This work provides an information-theoretic foundation for summary quality assessment and offers interpretable, principled optimization criteria for model design and compression.

Technology Category

Application Category

📝 Abstract
This paper introduces an information-theoretic framework for text summarization. We define the summarizer rate-distortion function and show that it provides a fundamental lower bound on summarizer performance. We describe an iterative procedure, similar to Blahut-Arimoto algorithm, for computing this function. To handle real-world text datasets, we also propose a practical method that can calculate the summarizer rate-distortion function with limited data. Finally, we empirically confirm our theoretical results by comparing the summarizer rate-distortion function with the performances of different summarizers used in practice.
Problem

Research questions and friction points this paper is trying to address.

Rate-Distortion Theory
Text Summarization
Information Loss
Innovation

Methods, ideas, or system contributions that make the work stand out.

rate-distortion framework
recursive Blahut-Arimoto algorithm
limited data computation method
🔎 Similar Papers
No similar papers found.
E
Enes Arda
INSPIRE@OhioState Research Center, Dept. of Electrical and Computer Engineering, The Ohio State University
Aylin Yener
Aylin Yener
Roy and Lois Chope Professor, The Ohio State University
green communicationsphysical layer securitysemantic communications6Ginformation theory