Statistical laws and linguistics inform meaning in naturalistic and fictional conversation

📅 2025-12-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates whether Heaps’ law—governing lexical growth in discourse—is modulated by part-of-speech (POS) categories in natural dialogue (video-recorded conversations between strangers) versus fictional dialogue (scripted film/TV dialogue). Using cross-medium corpus collection, fine-grained POS tagging, and POS-specific Heaps’ law fitting, we conduct the first systematic comparison of vocabulary expansion rates for core word classes (e.g., nouns, verbs) across these discourse types. Results reveal that nouns expand significantly faster in fictional dialogue, whereas verbs exhibit greater growth advantage in natural dialogue—demonstrating that discourse type (interactive vs. narrative) exerts a deep, functional modulation on statistical linguistic regularities. This work bridges statistical linguistics with pragmatics and narratological linguistics, providing empirical grounding and a novel analytical framework for understanding how communicative function shapes macro-level language statistics.

Technology Category

Application Category

📝 Abstract
Conversation is a cornerstone of social connection and is linked to well-being outcomes. Conversations vary widely in type with some portion generating complex, dynamic stories. One approach to studying how conversations unfold in time is through statistical patterns such as Heaps' law, which holds that vocabulary size scales with document length. Little work on Heaps's law has looked at conversation and considered how language features impact scaling. We measure Heaps' law for conversations recorded in two distinct mediums: 1. Strangers brought together on video chat and 2. Fictional characters in movies. We find that scaling of vocabulary size differs by parts of speech. We discuss these findings through behavioral and linguistic frameworks.
Problem

Research questions and friction points this paper is trying to address.

Investigates vocabulary scaling in conversations via Heaps' law
Compares real stranger chats with fictional movie dialogues
Analyzes how parts of speech affect vocabulary growth patterns
Innovation

Methods, ideas, or system contributions that make the work stand out.

Heaps' law applied to video chat conversations
Vocabulary scaling analyzed by parts of speech
Comparison of fictional and real conversation dynamics
🔎 Similar Papers
No similar papers found.
A
Ashley M. A. Fehr
Computational Story Lab, Vermont Advanced Computing Center, University of Vermont, Burlington, VT 05405, US
C
Calla G. Beauregard
Computational Story Lab, Vermont Advanced Computing Center, University of Vermont, Burlington, VT 05405, US
J
Julia Witte Zimmerman
Computational Story Lab, Vermont Advanced Computing Center, University of Vermont, Burlington, VT 05405, US
K
Katie Ekström
Glass Brain Lab, Department of Biomedical Engineering; Vermont Conversation Lab, Larner College of Medicine, University of Vermont, Burlington, VT 05405, US
Pablo Rosillo-Rodes
Pablo Rosillo-Rodes
PhD candidate, IFISC
computational linguisticssociophysicscomplex systems
C
Christopher M. Danforth
Computational Story Lab, Vermont Advanced Computing Center, University of Vermont, Burlington, VT 05405, US
Peter Sheridan Dodds
Peter Sheridan Dodds
Professor/Director, Computational Story Lab, Vermont Complex Systems Institute, UVM
LanguageMeaningStoriesSociotechnical PhenomenaComplex Systems