Emergent LLM behaviors are observationally equivalent to data leakage

📅 2025-05-26

📈 Citations: 0

✨ Influential: 0

career value

182K/year

🤖 AI Summary

This paper interrogates whether purported “emergent” linguistic conventions in large language models (LLMs) during naming games genuinely reflect social competence or merely stem from memorization of pretraining data. To address this, the authors employ a multifaceted methodology: prompt engineering, distributional shift detection, context provenance tracing, representation probing, and quantitative measurement of training-data overlap. Results demonstrate that observed linguistic coordination strongly replicates high-frequency patterns in the training corpus (p < 0.001), and ablation experiments confirm that removing these patterns eliminates conventional behavior. The work introduces the novel “observational equivalence” framework, revealing that LLMs’ superficially social behavior arises from statistical memorization rather than genuine normative reasoning. This challenges prevailing interpretations of “emergent norms” and fundamentally questions the validity of using LLMs as cognitive or social agents in computational social science.

Technology Category

Application Category

📝 Abstract

Ashery et al. recently argue that large language models (LLMs), when paired to play a classic"naming game,"spontaneously develop linguistic conventions reminiscent of human social norms. Here, we show that their results are better explained by data leakage: the models simply reproduce conventions they already encountered during pre-training. Despite the authors' mitigation measures, we provide multiple analyses demonstrating that the LLMs recognize the structure of the coordination game and recall its outcomes, rather than exhibit"emergent"conventions. Consequently, the observed behaviors are indistinguishable from memorization of the training corpus. We conclude by pointing to potential alternative strategies and reflecting more generally on the place of LLMs for social science models.

Problem

Research questions and friction points this paper is trying to address.

Distinguishing emergent LLM behaviors from data leakage

Assessing LLM memorization versus genuine convention development

Evaluating LLMs' suitability for social science modeling

Innovation

Methods, ideas, or system contributions that make the work stand out.

Analyzing data leakage in LLM behaviors

Demonstrating memorization over emergent conventions

Proposing alternative social science strategies

🔎 Similar Papers

The Emerged Security and Privacy of LLM Agent: A Survey with Case Studies