🤖 AI Summary
To address severe cross-task parameter interference, redundant trainable parameters, and challenges in continual learning within multi-task LoRA fine-tuning, this paper proposes OrthoLoRA. OrthoLoRA freezes the random projection matrix A and trains only matrix B equipped with task-specific sparse masks. Crucially, it is the first method to jointly enforce orthogonal subspace constraints and structured sparse masking, explicitly isolating task representation subspaces. This design substantially mitigates interference, enables efficient adapter merging, and supports incremental continual learning while effectively alleviating catastrophic forgetting. Evaluated across four diverse domains—natural language understanding, mathematical reasoning, code generation, and safety alignment—OrthoLoRA achieves superior performance using only 5% of the LoRA parameters (i.e., a 95% reduction in total tunable parameters) compared to standard LoRA, outperforming both full fine-tuning and state-of-the-art PEFT methods. These results demonstrate its dual advantages in parameter efficiency and generalization capability.
📝 Abstract
Low-Rank Adaptation (LoRA) has emerged as a popular parameter-efficient fine-tuning (PEFT) method for Large Language Models (LLMs), yet it still incurs notable overhead and suffers from parameter interference in multi-task scenarios. We propose LoRA with Reduced Interference (LoRI), a simple yet effective approach that freezes the projection matrices $A$ as random projections and sparsifies the matrices $B$ using task-specific masks. This design substantially reduces the number of trainable parameters while maintaining strong task performance. Moreover, LoRI minimizes cross-task interference in adapter merging by leveraging the orthogonality between adapter subspaces, and supports continual learning by using sparsity to mitigate catastrophic forgetting. Extensive experiments across natural language understanding, mathematical reasoning, code generation, and safety alignment tasks demonstrate that LoRI outperforms full fine-tuning and existing PEFT methods, while using up to 95% fewer trainable parameters than LoRA. In multi-task experiments, LoRI enables effective adapter merging and continual learning with reduced cross-task interference. Code is available at: https://github.com/juzhengz/LoRI