Computational Representations of Character Significance in Novels

๐Ÿ“… 2026-01-21
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses the limitations of traditional character modeling in literary analysis, which overemphasizes appearance frequency while neglecting crucial narrative dimensions such as discourse about characters by others and distinctions between narrator and character voices. Drawing on Wolochโ€™s theory of โ€œthe one vs. the many,โ€ the authors propose a six-dimensional structural model for character representation. Integrating large language models with a task-specific Transformer architecture, this approach enables a multidimensional computational characterization of fictional characters. It moves beyond frequency-centric paradigms by offering the first quantitative operationalization of implicit dimensions like โ€œdiscourse by others.โ€ Empirical validation on a corpus of 19th-century British realist novels not only confirms key theoretical assumptions but also uncovers nuanced relationships between character centrality and gender dynamics.

Technology Category

Application Category

๐Ÿ“ Abstract
Characters in novels have typically been modeled based on their presence in scenes in narrative, considering aspects like their actions, named mentions, and dialogue. This conception of character places significant emphasis on the main character who is present in the most scenes. In this work, we instead adopt a framing developed from a new literary theory proposing a six-component structural model of character. This model enables a comprehensive approach to character that accounts for the narrator-character distinction and includes a component neglected by prior methods, discussion by other characters. We compare general-purpose LLMs with task-specific transformers for operationalizing this model of character on major 19th-century British realist novels. Our methods yield both component-level and graph representations of character discussion. We then demonstrate that these representations allow us to approach literary questions at scale from a new computational lens. Specifically, we explore Woloch's classic"the one vs the many"theory of character centrality and the gendered dynamics of character discussion.
Problem

Research questions and friction points this paper is trying to address.

character significance
narrative representation
literary theory
computational modeling
character discussion
Innovation

Methods, ideas, or system contributions that make the work stand out.

character representation
six-component structural model
computational literary analysis
character discussion graph
LLM vs task-specific transformer
๐Ÿ”Ž Similar Papers
H
Haaris Mian
Applied Mathematics, Columbia University
Melanie Subbiah
Melanie Subbiah
Phd Student, Columbia University
Natural Language Processing
S
Sharon Marcus
English & Comparative Literature, Columbia University
N
Nora Shaalan
English & Comparative Literature, Columbia University
Kathleen McKeown
Kathleen McKeown
Professor of Computer Science and Director, Data Science Institute, Columbia University
Artificial IntelligenceNatural Language ProcessingText Summarization