Cross-lingual robustness of LLM-brain alignment and its computational roots

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

213K/year
🤖 AI Summary
This study investigates whether the alignment between large language models (LLMs) and the human brain during multilingual comprehension exhibits cross-linguistic robustness, extends to subcortical regions, and what computational mechanisms underlie this correspondence. Using whole-brain fMRI encoding models across naturalistic story-listening tasks in Chinese, English, and French, the authors analyze the capacity of Transformer-based models to predict neural responses. They report, for the first time, that LLMs consistently predict activity across a broad network of cortical and subcortical areas, with highly congruent alignment patterns across languages. This cross-linguistic neural alignment cannot be accounted for by prediction uncertainty or representational geometry, challenging prevailing views that emphasize contextual embedding superiority or surprisal-driven processing, and instead supports lexical–semantic correspondence as the core mechanism.
📝 Abstract
Large language models (LLMs) reliably predict neural activity during language comprehension and transformer depth has been interpreted as mirroring hierarchical cortical organization. However, it remains unclear whether such alignment extends to subcortical regions, overlaps spatially across languages, and what the computational roots of such alignment are. Here, we used a multilingual, whole-brain encoding framework to examine brain-LLM alignment across three typologically distinct languages: Mandarin, English, and French during naturalistic story listening. Our results show that across languages, transformer-based models predicted activity in a distributed landscape spanning widely distributed cortical functional networks like limbic, ventral attention, default mode network, and subcortical structures. Spatial alignment patterns showed substantial cross-linguistic overlap and remained largely stable across model layers, with limited layer progression consistent with functional cortical hierarchies. Contrary to previous evidence, contextual embeddings did not outperform static embeddings. To test candidate computational explanations, we examined whether layer-wise brain scores reflect surprisal and intrinsic dimensionality, and thereby predictive processing and information compression. Neither of these two computational metrics mirrored neural alignment profiles. Our findings suggest that brain-LLM alignment is spatially robust and cross-linguistically stable but not explainable from predictive uncertainty or representational geometry. Rather than directly reflecting shared hierarchical computation, neural predictivity may primarily arise from distributed lexical-semantic correspondences that generalize across languages.
Problem

Research questions and friction points this paper is trying to address.

cross-lingual robustness
brain-LLM alignment
subcortical regions
computational roots
neural predictivity
Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-lingual alignment
LLM-brain correspondence
whole-brain encoding
lexical-semantic mapping
transformer hierarchy
🔎 Similar Papers
No similar papers found.
N
Ni Yang
Grammar and Cognition Lab, Department of Translation & Language Sciences, Universitat Pompeu Fabra, Barcelona, Spain
R
Rui He
Grammar and Cognition Lab, Department of Translation & Language Sciences, Universitat Pompeu Fabra, Barcelona, Spain
P
Philipp Homan
Department of Adult Psychiatry and Psychotherapy, University of Zurich, Zurich, Switzerland; Neuroscience Center Zurich, University of Zurich and ETH Zurich, Zurich, Switzerland
Iris Sommer
Iris Sommer
Professor of Psychiatry
D
Davide Staub
Scalable Scientific Machine Learning Lab, Imperial College London, Department of Earth Science and Engineering, London, United Kingdom
W
Wolfram Hinzen
Grammar and Cognition Lab, Department of Translation & Language Sciences, Universitat Pompeu Fabra, Barcelona, Spain; Institut Català de Recerca i Estudis Avançats (ICREA), Barcelona, Spain