Do BERT Embeddings Encode Narrative Dimensions? A Token-Level Probing Analysis of Time, Space, Causality, and Character in Fiction

📅 2026-04-12

📈 Citations: 0

✨ Influential: 0

career value

160K/year

🤖 AI Summary

This study investigates whether BERT embeddings implicitly encode multidimensional semantic structures—namely temporal, spatial, causal, and character-related information—in fictional narratives. The authors construct the first token-level narrative annotation dataset across these four dimensions with the aid of large language models and systematically evaluate BERT’s representational capacity using linear probing, class-balanced weighting, confusion matrices, and clustering analysis assessed via Adjusted Rand Index (ARI). Results demonstrate that BERT indeed captures significant narrative information, achieving 94% accuracy and a macro-averaged recall of 0.83 with linear probes. Causal (0.75) and spatial (0.66) dimensions exhibit particularly strong performance, significantly outperforming random baselines. The study also identifies a “boundary leakage” phenomenon, revealing that while narrative semantics are present, they do not form discrete clusters—providing the first token-level evidence of BERT’s implicit encoding of complex narrative structures.

Technology Category

Application Category

📝 Abstract

Narrative understanding requires multidimensional semantic structures. This study investigates whether BERT embeddings encode dimensions of fictional narrative semantics -- time, space, causality, and character. Using an LLM to accelerate annotation, we construct a token-level dataset labeled with these four narrative categories plus "others." A linear probe on BERT embeddings (94% accuracy) significantly outperforms a control probe on variance-matched random embeddings (47%), confirming that BERT encodes meaningful narrative information. With balanced class weighting, the probe achieves a macro-average recall of 0.83, with moderate success on rare categories such as causality (recall = 0.75) and space (recall = 0.66). However, confusion matrix analysis reveals "Boundary Leakage," where rare dimensions are systematically misclassified as "others." Clustering analysis shows that unsupervised clustering aligns near-randomly with predefined categories (ARI = 0.081), suggesting that narrative dimensions are encoded but not as discretely separable clusters. Future work includes a POS-only baseline to disentangle syntactic patterns from narrative encoding, expanded datasets, and layer-wise probing.

Problem

Research questions and friction points this paper is trying to address.

BERT embeddings

narrative semantics

time

space

causality

Innovation

Methods, ideas, or system contributions that make the work stand out.

BERT embeddings

narrative semantics

probing analysis