Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels

๐Ÿ“… 2025-12-08
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This paper addresses two key challenges in literary immersive dialogue with large language models: persona drift and narrative logic breakdown (e.g., spoilers, frame violations). To tackle these, we propose a two-stage training framework that innovatively integrates data-free reinforcement learning fine-tuning (RLFT), retrieval-augmented training, and a story timeline-aware knowledge graphโ€”enabling deep persona alignment and explicit narrative constraint modeling. Our method incorporates persona self-training and interference-resilient mechanisms, significantly improving stability and consistency in multi-character mobile dialogues. Empirical evaluation on *Twenty Thousand Leagues Under the Sea* demonstrates superior persona consistency over GPT-4o and achieves >98% narrative coherence and robustness. User diary analysis further confirms long-term interaction reliability. The core contribution is the first integration of temporal-aware knowledge graphs with unsupervised persona alignment, delivering a scalable solution for narrative consistency in literary AI agents.

Technology Category

Application Category

๐Ÿ“ Abstract
We present the Living Novel, an end-to-end system that transforms any literary work into an immersive, multi-character conversational experience. This system is designed to solve two fundamental challenges for LLM-driven characters. Firstly, generic LLMs suffer from persona drift, often failing to stay in character. Secondly, agents often exhibit abilities that extend beyond the constraints of the story's world and logic, leading to both narrative incoherence (spoiler leakage) and robustness failures (frame-breaking). To address these challenges, we introduce a novel two-stage training pipeline. Our Deep Persona Alignment (DPA) stage uses data-free reinforcement finetuning to instill deep character fidelity. Our Coherence and Robustness Enhancing (CRE) stage then employs a story-time-aware knowledge graph and a second retrieval-grounded training pass to architecturally enforce these narrative constraints. We validate our system through a multi-phase evaluation using Jules Verne's Twenty Thousand Leagues Under the Sea. A lab study with a detailed ablation of system components is followed by a 5-day in-the-wild diary study. Our DPA pipeline helps our specialized model outperform GPT-4o on persona-specific metrics, and our CRE stage achieves near-perfect performance in coherence and robustness measures. Our study surfaces practical design guidelines for AI-driven narrative systems: we find that character-first self-training is foundational for believability, while explicit story-time constraints are crucial for sustaining coherent, interruption-resilient mobile-web experiences.
Problem

Research questions and friction points this paper is trying to address.

Addresses persona drift in LLM-driven characters for narrative fidelity.
Prevents spoiler leakage and frame-breaking to maintain story coherence.
Enforces narrative constraints using timeline-aware knowledge graphs.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Two-stage training pipeline for character fidelity and narrative coherence
Deep Persona Alignment using data-free reinforcement finetuning
Coherence and Robustness Enhancing with story-time-aware knowledge graph
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yifei Huang
Shanda AI Research, Tokyo
T
Tianyu Yan
Dalian University of Technology
S
Sitong Gong
Dalian University of Technology
X
Xiwei Gao
Shanda AI Research, Tokyo
Caixin Kang
Caixin Kang
The University of Tokyo
Computer VisionTrustworthy AIAutonomous DrivingGenerative Models
Ruicong Liu
Ruicong Liu
The University of Tokyo
computer vision
H
Huchuan Lu
Dalian University of Technology
B
Bo Zheng
Shanda AI Research, Tokyo