Emergence and Localisation of Semantic Role Circuits in LLMs

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
The internal representation of abstract semantic structures—particularly semantic roles—in large language models (LLMs) remains poorly understood. Method: We propose the “role-crossing minimal pair” localization method, integrating temporal emergence analysis, attribution tracing, and cross-architectural comparison across model scales to systematically identify and validate semantic role circuits. Contribution/Results: We discover a compact circuit comprising only 28 neurons that accounts for 89%–94% of semantic role attribution, exhibiting strong causal isolation and partial transferability across model sizes. Further analysis reveals the circuit operates via a progressive structural refinement mechanism, characterized by cross-scale conservation and high spectral similarity. This work provides the first neuron-level characterization of the dynamic formation pathway of semantic roles in LLMs, establishing a novel interpretability paradigm for semantic structure in foundation models.

Technology Category

Application Category

📝 Abstract
Despite displaying semantic competence, large language models' internal mechanisms that ground abstract semantic structure remain insufficiently characterised. We propose a method integrating role-cross minimal pairs, temporal emergence analysis, and cross-model comparison to study how LLMs implement semantic roles. Our analysis uncovers: (i) highly concentrated circuits (89-94% attribution within 28 nodes); (ii) gradual structural refinement rather than phase transitions, with larger models sometimes bypassing localised circuits; and (iii) moderate cross-scale conservation (24-59% component overlap) alongside high spectral similarity. These findings suggest that LLMs form compact, causally isolated mechanisms for abstract semantic structure, and these mechanisms exhibit partial transfer across scales and architectures.
Problem

Research questions and friction points this paper is trying to address.

Characterizing internal mechanisms grounding semantic structure in LLMs
Studying implementation of semantic roles through role-cross minimal pairs
Analyzing circuit concentration and cross-scale conservation in models
Innovation

Methods, ideas, or system contributions that make the work stand out.

Role-cross minimal pairs for semantic analysis
Temporal emergence analysis of circuit development
Cross-model comparison revealing structural conservation
🔎 Similar Papers
No similar papers found.
N
Nura Aljaafari
Department of Computer Science, University of Manchester, United Kingdom
Danilo S. Carvalho
Danilo S. Carvalho
University of Manchester
Artificial IntelligenceNatural Language Processing
A
André Freitas
Department of Computer Science, University of Manchester, United Kingdom; Idiap Research Institute, Switzerland; National Biomarker Centre, CRUK-MI, Univ. of Manchester, United Kingdom