$α$-LoRA: Effective Fine-Tuning via Base Model Rescaling

📅 2025-10-24

📈 Citations: 0

✨ Influential: 0

career value

201K/year

🤖 AI Summary

To address the insufficient generalization of pretrained models when fine-tuned on few-shot, high-dimensional binary classification tasks, this paper proposes a novel fine-tuning framework based on weight matrix reparameterization. The method couples low-rank adaptation (LoRA) with a base-model rescaling mechanism and employs random matrix theory to model the generalization behavior of high-dimensional classifiers, thereby revealing how rescaling governs spectral distribution and generalization bounds. Theoretically, the approach substantially mitigates overfitting by controlling the effective rank and condition number of the classifier’s weight matrix. Empirically, it consistently improves performance across multiple binary classification benchmarks and large language model (LLM) fine-tuning tasks, demonstrating particularly pronounced generalization gains under extreme data scarcity.

Technology Category

Application Category

📝 Abstract

Fine-tuning has proven to be highly effective in adapting pre-trained models to perform better on new desired tasks with minimal data samples. Among the most widely used approaches are reparameterization methods, which update a target module by augmenting its frozen weight matrix with an additional trainable weight matrix. The most prominent example is Low Rank Adaption (LoRA), which gained significant attention in recent years. In this paper, we introduce a new class of reparameterization methods for transfer learning, designed to enhance the generalization ability of fine-tuned models. We establish the effectiveness of our approach in a high-dimensional binary classification setting using tools from Random Matrix Theory, and further validate our theoretical findings through more realistic experiments, such as fine-tuning LLMs.

Problem

Research questions and friction points this paper is trying to address.

Improving generalization ability of fine-tuned models

Enhancing reparameterization methods for transfer learning

Validating effectiveness through theoretical and experimental approaches

Innovation

Methods, ideas, or system contributions that make the work stand out.

Base model rescaling enhances fine-tuning generalization

Reparameterization methods augment frozen weight matrices

Random Matrix Theory validates high-dimensional classification effectiveness

🔎 Similar Papers

ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation