LitVISTA: A Benchmark for Narrative Orchestration in Literary Text

📅 2026-01-10
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses a systematic bias in current large language models, which overemphasize causal coherence at the expense of the intricate narrative arcs and structural orchestration characteristic of human storytelling when generating long-form narratives. To remedy this, the authors propose VISTA Space, a high-dimensional representation framework that unifies the modeling of narrative functions and structure, and introduce LitVISTA—the first structured evaluation benchmark tailored for literary narration. LitVISTA integrates high-dimensional narrative representations, fine-grained literary annotations, and an oracle-based assessment protocol leveraging mainstream large language models (e.g., GPT, Claude, Grok, Gemini) to systematically evaluate narrative structuring capabilities. Experiments reveal that existing models struggle to maintain globally coherent narrative perspectives and exhibit significant deficiencies in simultaneously capturing functional and structural narrative elements, with advanced reasoning strategies offering only marginal improvements.

Technology Category

Application Category

📝 Abstract
Computational narrative analysis aims to capture rhythm, tension, and emotional dynamics in literary texts. Existing large language models can generate long stories but overly focus on causal coherence, neglecting the complex story arcs and orchestration inherent in human narratives. This creates a structural misalignment between model- and human-generated narratives. We propose VISTA Space, a high-dimensional representational framework for narrative orchestration that unifies human and model narrative perspectives. We further introduce LitVISTA, a structurally annotated benchmark grounded in literary texts, enabling systematic evaluation of models'narrative orchestration capabilities. We conduct oracle evaluations on a diverse selection of frontier LLMs, including GPT, Claude, Grok, and Gemini. Results reveal systematic deficiencies: existing models fail to construct a unified global narrative view, struggling to jointly capture narrative function and structure. Furthermore, even advanced thinking modes yield only limited gains for such literary narrative understanding.
Problem

Research questions and friction points this paper is trying to address.

narrative orchestration
literary text
story arcs
structural alignment
large language models
Innovation

Methods, ideas, or system contributions that make the work stand out.

narrative orchestration
VISTA Space
LitVISTA benchmark
literary text analysis
large language models
🔎 Similar Papers
M
Mingzhe Lu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Y
Yiwen Wang
Northeastern University
Y
Yanbing Liu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Q
Qi You
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
C
Chong Liu
University of Science and Technology of China
R
Ruize Qin
University of Melbourne
Haoyu Dong
Haoyu Dong
Microsoft Research; UCAS
W
Wenyu Zhang
University of Science and Technology of China
J
Jiarui Zhang
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Y
Yue Hu
Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Yunpeng Li
Yunpeng Li
Institute of Information Engineering,Chinese Academy of Sciences
Large Language ModelsCyber Security