Empirical evidence of Large Language Model's influence on human spoken communication

📅 2024-09-03

🏛️ arXiv.org

📈 Citations: 11

✨ Influential: 1

career value

198K/year

🤖 AI Summary

This study investigates whether large language models (LLMs), such as ChatGPT, reshape human spoken language and cultural practices through human–AI linguistic feedback loops. Method: Leveraging ASR-transcribed speech from 280,000 university-level YouTube lecture videos, we conduct time-series word frequency analysis and employ a pre–post ChatGPT release quasi-experimental design. Contribution/Results: We present the first empirical evidence that LLMs directly influence authentic human spoken behavior: post-release, ChatGPT-characteristic lexical items exhibit statistically significant increases in academic speech (p < 0.001), confirming systematic oral imitation by humans. Moving beyond prior written-language–focused work, this study reveals the mechanism of AI-generated language diffusion into spoken discourse. It further highlights critical sociocultural risks—including erosion of linguistic diversity, discursive manipulation, and asymmetric human–AI co-evolution—thereby advancing foundational understanding of LLMs’ real-world linguistic impact.

Technology Category

Application Category

📝 Abstract

Artificial Intelligence (AI) agents now interact with billions of humans in natural language, thanks to advances in Large Language Models (LLMs) like ChatGPT. This raises the question of whether AI has the potential to shape a fundamental aspect of human culture: the way we speak. Recent analyses revealed that scientific publications already exhibit evidence of AI-specific language. But this evidence is inconclusive, since scientists may simply be using AI to copy-edit their writing. To explore whether AI has influenced human spoken communication, we transcribed and analyzed about 280,000 English-language videos of presentations, talks, and speeches from more than 20,000 YouTube channels of academic institutions. We find a significant shift in the trend of word usage specific to words distinctively associated with ChatGPT following its release. These findings provide the first empirical evidence that humans increasingly imitate LLMs in their spoken language. Our results raise societal and policy-relevant concerns about the potential of AI to unintentionally reduce linguistic diversity, or to be deliberately misused for mass manipulation. They also highlight the need for further investigation into the feedback loops between machine behavior and human culture.

Problem

Research questions and friction points this paper is trying to address.

Investigates ChatGPT's impact on human spoken language patterns

Measures lexicographic shifts in academic and podcast discourse post-ChatGPT

Assesses risks of AI-driven cultural homogenization and manipulation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Econometric causal inference techniques applied

Detects lexicographic shift in human speech

Measures ChatGPT's cultural influence quantitatively

🔎 Similar Papers

The Adoption and Efficacy of Large Language Models: Evidence From Consumer Complaints in the Financial Industry