ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging

๐Ÿ“… 2026-01-09
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses a key challenge in integrating strong reasoning capabilities with domain-specific expertiseโ€”namely, how to effectively inject advanced reasoning without compromising specialized performance. The authors propose ReasonAny, a framework that reveals reasoning abilities are predominantly encoded in parameter regions with low gradient sensitivity. Leveraging this insight, they develop a training-free model merging strategy that precisely identifies and preserves critical parameters for both reasoning and domain tasks through gradient-based contrastive analysis. This approach circumvents the common pitfall of conventional fusion methods, which often degrade both reasoning depth and domain performance. Evaluated across diverse domains including safety, biomedicine, and finance, ReasonAny consistently outperforms existing techniques, simultaneously enhancing reasoning capability and domain-specific task accuracy.

Technology Category

Application Category

๐Ÿ“ Abstract
Large Reasoning Models (LRMs) with long chain-of-thought reasoning have recently achieved remarkable success. Yet, equipping domain-specialized models with such reasoning capabilities, referred to as"Reasoning + X", remains a significant challenge. While model merging offers a promising training-free solution, existing methods often suffer from a destructive performance collapse: existing methods tend to both weaken reasoning depth and compromise domain-specific utility. Interestingly, we identify a counter-intuitive phenomenon underlying this failure: reasoning ability predominantly resides in parameter regions with low gradient sensitivity, contrary to the common assumption that domain capabilities correspond to high-magnitude parameters. Motivated by this insight, we propose ReasonAny, a novel merging framework that resolves the reasoning-domain performance collapse through Contrastive Gradient Identification. Experiments across safety, biomedicine, and finance domains show that ReasonAny effectively synthesizes"Reasoning + X"capabilities, significantly outperforming state-of-the-art baselines while retaining robust reasoning performance.
Problem

Research questions and friction points this paper is trying to address.

Reasoning
Model Merging
Domain-Specialized Models
Performance Collapse
Large Reasoning Models
Innovation

Methods, ideas, or system contributions that make the work stand out.

model merging
reasoning capability
contrastive gradient identification
parameter sensitivity
reasoning + X
๐Ÿ”Ž Similar Papers
No similar papers found.
J
Junyao Yang
Shanghai Artificial Intelligence Laboratory, National University of Singapore
Chen Qian
Chen Qian
Renmin University of China
Large Language ModelsSafetyInterpretabilityGraph Neural Networks
D
Dongrui Liu
Shanghai Artificial Intelligence Laboratory
W
Wen Shen
Tongji University
Y
Yong Liu
Renmin University of China
Jing Shao
Jing Shao
Research Scientist, Shanghai AI Laboratory/Shanghai Jiao Tong University
Computer VisionMulti-Modal Large Language Model