When Shared Knowledge Hurts: Spectral Over-Accumulation in Model Merging

📅 2026-02-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work identifies and quantifies a previously overlooked issue in model merging—termed “spectral over-accumulation”—where knowledge sharing across tasks leads to redundant accumulation along dominant spectral directions, causing singular value inflation and biasing the merged model toward a shared subspace. To address this, the authors propose a training- and data-free post-processing method that leverages singular value decomposition to analyze the weight spectrum and calibrates singular values via a subspace overlap-aware rescaling strategy, thereby restoring spectral balance. Evaluated on vision and language benchmarks, the approach significantly enhances existing merging techniques, improving the accuracy of Task Arithmetic by 13.0% and achieving state-of-the-art performance.

Technology Category

Application Category

📝 Abstract
Model merging combines multiple fine-tuned models into a single model by adding their weight updates, providing a lightweight alternative to retraining. Existing methods primarily target resolving conflicts between task updates, leaving the failure mode of over-counting shared knowledge unaddressed. We show that when tasks share aligned spectral directions (i.e., overlapping singular vectors), a simple linear combination repeatedly accumulates these directions, inflating the singular values and biasing the merged model toward shared subspaces. To mitigate this issue, we propose Singular Value Calibration (SVC), a training-free and data-free post-processing method that quantifies subspace overlap and rescales inflated singular values to restore a balanced spectrum. Across vision and language benchmarks, SVC consistently improves strong merging baselines and achieves state-of-the-art performance. Furthermore, by modifying only the singular values, SVC improves the performance of Task Arithmetic by 13.0%. Code is available at: https://github.com/lyymuwu/SVC.
Problem

Research questions and friction points this paper is trying to address.

model merging
spectral over-accumulation
shared knowledge
singular values
subspace overlap
Innovation

Methods, ideas, or system contributions that make the work stand out.

model merging
spectral over-accumulation
singular value calibration
subspace overlap
task arithmetic
🔎 Similar Papers
No similar papers found.
Yayuan Li
Yayuan Li
University of Michigan
AR-AI Instructional AgentInstructional VideosVideo GenerationVision and Language
Z
Ze Peng
State Key Laboratory for Novel Software Technology and the National Institute of Healthcare Data Science, Nanjing University, Nanjing, Jiangsu 210093, China
J
Jian Zhang
State Key Laboratory for Novel Software Technology and the National Institute of Healthcare Data Science, Nanjing University, Nanjing, Jiangsu 210093, China
J
Jintao Guo
State Key Laboratory for Novel Software Technology and the National Institute of Healthcare Data Science, Nanjing University, Nanjing, Jiangsu 210093, China
Yue Duan
Yue Duan
Nanjing University
Semi-supervised LearningMultimodal LearningLarge Multimodal Models
Y
Yinghuan Shi
State Key Laboratory for Novel Software Technology and the National Institute of Healthcare Data Science, Nanjing University, Nanjing, Jiangsu 210093, China