🤖 AI Summary
This study investigates fine-grained substructure within coarse-grained semantic categories (e.g., “animal”, “tool”). To this end, we propose the Decoupled Continuous Semantic Representation Model (DCSRM), the first data-driven approach to decompose large language model word embeddings into multiple interpretable sub-embeddings, each encoding a distinct semantic sub-dimension. We identify semantic polarity—such as size, animacy, and valence—as a key latent variable driving this decomposition. Our method integrates embedding disentanglement, interpretability analysis, and voxel-wise fMRI-based neural encoding modeling, with validation against empirical neuroimaging data. Results demonstrate that the identified sub-dimensions exhibit statistically significant and spatially specific cortical representations, markedly enhancing both the granularity of semantic representation and its cognitive and neurobiological interpretability.
📝 Abstract
Understanding the core dimensions of conceptual semantics is fundamental to uncovering how meaning is organized in language and the brain. Existing approaches often rely on predefined semantic dimensions that offer only broad representations, overlooking finer conceptual distinctions. This paper proposes a novel framework to investigate the subdimensions underlying coarse-grained semantic dimensions. Specifically, we introduce a Disentangled Continuous Semantic Representation Model (DCSRM) that decomposes word embeddings from large language models into multiple sub-embeddings, each encoding specific semantic information. Using these sub-embeddings, we identify a set of interpretable semantic subdimensions. To assess their neural plausibility, we apply voxel-wise encoding models to map these subdimensions to brain activation. Our work offers more fine-grained interpretable semantic subdimensions of conceptual meaning. Further analyses reveal that semantic dimensions are structured according to distinct principles, with polarity emerging as a key factor driving their decomposition into subdimensions. The neural correlates of the identified subdimensions support their cognitive and neuroscientific plausibility.