🤖 AI Summary
Addressing the longstanding trade-off between parameter efficiency and representational capacity in large language model (LLM) adaptation, this paper proposes Hadamard Low-Rank Adaptation (HLora), a fully decoupled, parameter-efficient fine-tuning method that operates independently of pretrained weights. HLora reconstructs incremental updates via the Hadamard product of two learnable low-rank matrices—introducing, for the first time, a weight-agnostic and inherently nonlinear low-rank update structure, thereby overcoming the expressivity limitations of linear low-rank methods such as LoRA. Theoretical analysis and matrix reconstruction experiments demonstrate HLora’s superior representational power. Empirically, HLora achieves state-of-the-art performance across diverse multitask benchmarks—including arithmetic and commonsense reasoning—outperforming leading parameter-efficient fine-tuning (PEFT) approaches like LoRA and HiRA. Moreover, it consistently delivers robust gains across multiple large-scale models (e.g., Llama, Qwen), confirming its generalizability and stability.
📝 Abstract
Large Language Models have demonstrated strong performance across a wide range of tasks, but adapting them efficiently to new domains remains a key challenge. Parameter-Efficient Fine-Tuning (PEFT) methods address this by introducing lightweight, trainable modules while keeping most pre-trained weights fixed. The prevailing approach, LoRA, models updates using a low-rank decomposition, but its expressivity is inherently constrained by the rank. Recent methods like HiRA aim to increase expressivity by incorporating a Hadamard product with the frozen weights, but still rely on the structure of the pre-trained model. We introduce ABBA, a new PEFT architecture that reparameterizes the update as a Hadamard product of two independently learnable low-rank matrices. In contrast to prior work, ABBA fully decouples the update from the pre-trained weights, enabling both components to be optimized freely. This leads to significantly higher expressivity under the same parameter budget. We formally analyze ABBA's expressive capacity and validate its advantages through matrix reconstruction experiments. Empirically, ABBA achieves state-of-the-art results on arithmetic and commonsense reasoning benchmarks, consistently outperforming existing PEFT methods by a significant margin across multiple models. Our code is publicly available at: https://github.com/CERT-Lab/abba.