Catastrophic Forgetting Mitigation Through Plateau Phase Activity Profiling

📅 2025-07-11

📈 Citations: 0

✨ Influential: 0

career value

162K/year

🤖 AI Summary

To address catastrophic forgetting in continual learning of deep neural networks, this paper proposes a novel parameter selection method based on activity analysis during the late-stage plateau phase of training. Unlike conventional approaches that monitor parameter dynamics throughout the entire training process, our method identifies highly active parameters during the plateau phase—these correspond to flat regions of the loss landscape, which facilitate joint optimization of old and new knowledge. We introduce a regularization mechanism that dynamically tracks parameter movement and variability, focusing evaluation on parameter adaptability after convergence. Experiments demonstrate that our approach significantly mitigates forgetting while simultaneously improving performance on new tasks, achieving a superior balance between forward and backward transfer accuracy. By leveraging interpretable, convergence-driven parameter activity, the method establishes a lightweight, efficient, and theoretically grounded paradigm for selective parameter adaptation in continual learning.

Technology Category

Application Category

📝 Abstract

Catastrophic forgetting in deep neural networks occurs when learning new tasks degrades performance on previously learned tasks due to knowledge overwriting. Among the approaches to mitigate this issue, regularization techniques aim to identify and constrain "important" parameters to preserve previous knowledge. In the highly nonconvex optimization landscape of deep learning, we propose a novel perspective: tracking parameters during the final training plateau is more effective than monitoring them throughout the entire training process. We argue that parameters that exhibit higher activity (movement and variability) during this plateau reveal directions in the loss landscape that are relatively flat, making them suitable for adaptation to new tasks while preserving knowledge from previous ones. Our comprehensive experiments demonstrate that this approach achieves superior performance in balancing catastrophic forgetting mitigation with strong performance on newly learned tasks.

Problem

Research questions and friction points this paper is trying to address.

Mitigates catastrophic forgetting in deep neural networks

Identifies important parameters during plateau phase

Balances knowledge retention with new task performance

Innovation

Methods, ideas, or system contributions that make the work stand out.

Track parameters during final training plateau

Identify high activity parameters in plateau phase

Balance forgetting mitigation with new task performance

🔎 Similar Papers

Forgetting Order of Continual Learning: Examples That are Learned First are Forgotten Last