How Relevance Emerges: Interpreting LoRA Fine-Tuning in Reranking LLMs

📅 2025-04-05

📈 Citations: 0

✨ Influential: 0

career value

216K/year

🤖 AI Summary

This study systematically investigates how LoRA fine-tuning enables large language models (Mistral-7B, LLaMA3.1-8B, Pythia-6.9B) to model and leverage relevance signals for paragraph re-ranking. To this end, we employ multi-rank (1/2/8/32) LoRA configurations, layer-wise behavior tracking, module-level ablation, and evaluation on MS MARCO. Our analysis uncovers the dynamic evolution of relevance modeling during adaptation. We identify— for the first time—the critical fine-tuned layers (mid-transformer blocks) and core subspaces (Q/K projections within multi-head attention) that dominantly govern re-ranking performance. Moreover, we reveal a nonlinear relationship between LoRA rank and module importance: low-rank adapters (e.g., rank 1 or 2) suffice to capture relevance effectively. These findings establish a new paradigm for interpretable, parameter-efficient adaptation in information retrieval. All models and analysis code are publicly released.

Technology Category

Application Category

📝 Abstract

We conduct a behavioral exploration of LoRA fine-tuned LLMs for Passage Reranking to understand how relevance signals are learned and deployed by Large Language Models. By fine-tuning Mistral-7B, LLaMA3.1-8B, and Pythia-6.9B on MS MARCO under diverse LoRA configurations, we investigate how relevance modeling evolves across checkpoints, the impact of LoRA rank (1, 2, 8, 32), and the relative importance of updated MHA vs. MLP components. Our ablations reveal which layers and projections within LoRA transformations are most critical for reranking accuracy. These findings offer fresh explanations into LoRA's adaptation mechanisms, setting the stage for deeper mechanistic studies in Information Retrieval. All models used in this study have been shared.

Problem

Research questions and friction points this paper is trying to address.

How relevance signals are learned in LoRA fine-tuned LLMs

Impact of LoRA rank and components on reranking accuracy

Critical layers and projections in LoRA transformations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Behavioral exploration of LoRA fine-tuned LLMs

Investigates LoRA rank impact on relevance modeling

Ablations reveal critical LoRA layers for reranking

🔎 Similar Papers

Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers