Computational Lesions in Multilingual Language Models Separate Shared and Language-specific Brain Alignment

📅 2026-04-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

207K/year
🤖 AI Summary
This study investigates whether the human brain employs shared and language-specific neural mechanisms in multilingual processing and establishes an interpretable alignment between multilingual large language models and brain activity. By applying targeted “computational lesions”—selectively zeroing out either shared or language-specific model parameters—and evaluating changes in the models’ ability to predict fMRI responses from 112 native speakers listening to English, Chinese, and French narratives, the research causally disentangles shared core components from language-specialized modules within six multilingual models. Lesioning the shared core reduced whole-brain encoding performance by 60.32%, whereas impairing language-specific components selectively degraded prediction accuracy for the corresponding native language while preserving cross-lingual representational separation. These findings reveal a “shared backbone plus language specialization” architecture underlying multilingual representation, offering a novel paradigm for cognitive modeling.

Technology Category

Application Category

📝 Abstract
How the brain supports language across different languages is a basic question in neuroscience and a useful test for multilingual artificial intelligence. Neuroimaging has identified language-responsive brain regions across languages, but it cannot by itself show whether the underlying processing is shared or language-specific. Here we use six multilingual large language models (LLMs) as controllable systems and create targeted ``computational lesions'' by zeroing small parameter sets that are important across languages or especially important for one language. We then compare intact and lesioned models in predicting functional magnetic resonance imaging (fMRI) responses during 100 minutes of naturalistic story listening in native English, Chinese and French (112 participants). Lesioning a compact shared core reduces whole-brain encoding correlation by 60.32% relative to intact models, whereas language-specific lesions preserve cross-language separation in embedding space but selectively weaken brain predictivity for the matched native language. These results support a shared backbone with embedded specializations and provide a causal framework for studying multilingual brain-model alignment.
Problem

Research questions and friction points this paper is trying to address.

multilingual
brain alignment
language-specific
shared processing
neuroscience
Innovation

Methods, ideas, or system contributions that make the work stand out.

computational lesions
multilingual language models
brain-model alignment
fMRI prediction
shared vs. language-specific processing
🔎 Similar Papers
No similar papers found.
Y
Yang Cui
Department of Computer Science, The University of Manchester, Manchester, UK.
Jingyuan Sun
Jingyuan Sun
Assistant Professor, The University of Manchester
neural encoding and decodingbrain machine interfacelarge language models
Yizheng Sun
Yizheng Sun
PhD Student, The University of Manchester
Vision-Language Multi-modalityComputation Efficiency
Y
Yifan Wang
Department of Computer Science, The University of Manchester, Manchester, UK.
Yunhao Zhang
Yunhao Zhang
Institute of Automation, Chinese Academy of Sciences
Natural Language ProcessingCognitive Science
J
Jixing Li
Department of Linguistics and Translation, City University of Hong Kong, Hong Kong, China.
Shaonan Wang
Shaonan Wang
The Hong Kong Polytechnic University
Natural Language Understanding of Machine and Mind
H
Hongpeng Zhou
Department of Computer Science, The University of Manchester, Manchester, UK.
John Hale
John Hale
Cognitive Science Department, Johns Hopkins University
cognitive science of language
C
Chengqing Zong
State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, CAS, Beijing, China.; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China.
Goran Nenadic
Goran Nenadic
Department of Computer Science, University of Manchester
Natural language processingtext mininghealth informatics