JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models

📅 2026-04-17
📈 Citations: 0
Influential: 0
📄 PDF

career value

205K/year
🤖 AI Summary
This work addresses the catastrophic forgetting problem in large language models during continual learning, which arises from sequential task acquisition. To mitigate interference across tasks, the authors propose JumpLoRA, a novel framework that introduces the JumpReLU gating mechanism into the LoRA module for the first time. This design enables adaptive, sparse activation to achieve dynamic parameter isolation, effectively reducing cross-task interference. JumpLoRA maintains high modularity and compatibility while incurring minimal computational overhead, significantly outperforming state-of-the-art methods such as ELLA. The approach establishes a new paradigm for efficient and scalable continual learning in large language models.

Technology Category

Application Category

📝 Abstract
Adapter-based methods have become a cost-effective approach to continual learning (CL) for Large Language Models (LLMs), by sequentially learning a low-rank update matrix for each task. To mitigate catastrophic forgetting, state-of-the-art approaches impose constraints on new adapters with respect to the previous ones, by targeting either subspace or coordinate-wise interference. In this paper, we propose JumpLoRA, a novel framework to adaptively induce sparsity in the Low-Rank Adaptation (LoRA) blocks through the use of JumpReLU gating. The method achieves dynamic parameter isolation, which helps prevent task interference. We demonstrate that our method is highly modular and compatible with LoRA-based CL approaches. Specifically, it significantly boosts the performance of IncLoRA and outperforms the leading state-of-the-art CL method, ELLA.
Problem

Research questions and friction points this paper is trying to address.

Continual Learning
Catastrophic Forgetting
Large Language Models
Adapter-based Methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

JumpLoRA
Sparse Adapters
Continual Learning
Low-Rank Adaptation
JumpReLU
🔎 Similar Papers
No similar papers found.