Forgetting is Everywhere

📅 2025-11-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Catastrophic forgetting—the loss of previously acquired knowledge when adapting to new data—remains a fundamental challenge for general-purpose learning algorithms. Method: We propose the first algorithm- and task-agnostic unified forgetting theory, defining forgetting as the learner’s failure to maintain self-consistency in its predictive distribution over future experiences, quantified via predictive information loss. This framework unifies forgetting analysis across classification, regression, generative modeling, and reinforcement learning. We derive a computationally tractable information retention metric grounded in this theory. Contribution/Results: Extensive cross-paradigm experiments validate the metric’s ability to accurately characterize forgetting dynamics. Crucially, forgetting severity exhibits a strong negative correlation with learning efficiency. Our theory thus provides both a rigorous foundation for understanding forgetting and a practical tool for designing and evaluating forgetting-mitigation algorithms.

Technology Category

Application Category

📝 Abstract
A fundamental challenge in developing general learning algorithms is their tendency to forget past knowledge when adapting to new data. Addressing this problem requires a principled understanding of forgetting; yet, despite decades of study, no unified definition has emerged that provides insights into the underlying dynamics of learning. We propose an algorithm- and task-agnostic theory that characterises forgetting as a lack of self-consistency in a learner's predictive distribution over future experiences, manifesting as a loss of predictive information. Our theory naturally yields a general measure of an algorithm's propensity to forget. To validate the theory, we design a comprehensive set of experiments that span classification, regression, generative modelling, and reinforcement learning. We empirically demonstrate how forgetting is present across all learning settings and plays a significant role in determining learning efficiency. Together, these results establish a principled understanding of forgetting and lay the foundation for analysing and improving the information retention capabilities of general learning algorithms.
Problem

Research questions and friction points this paper is trying to address.

Developing general learning algorithms that avoid forgetting past knowledge
Establishing unified theoretical framework to characterize forgetting dynamics
Measuring forgetting across diverse learning settings and algorithms
Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposes algorithm-agnostic forgetting theory
Defines forgetting as predictive inconsistency
Introduces general measure for forgetting propensity
🔎 Similar Papers
B
Ben Sanati
University of Edinburgh, School of Informatics
T
Thomas L. Lee
University of Edinburgh, School of Informatics
T
Trevor A. McInroe
University of Edinburgh, School of Informatics
Aidan Scannell
Aidan Scannell
University of Edinburgh
machine learningsequential decision-makingrobotics
Nikolay Malkin
Nikolay Malkin
University of Edinburgh
David Abel
David Abel
DeepMind / University of Edinburgh
Reinforcement LearningArtificial IntelligencePhilosophy
A
A. Storkey
University of Edinburgh, School of Informatics