Attributing Emergence in Million-Agent Systems

📅 2026-05-11

📈 Citations: 0

✨ Influential: 0

career value

259K/year

🤖 AI Summary

This study addresses the challenge of attributing macroscopic emergent phenomena to individual agents in large-scale multi-agent systems, a task where existing axiomatic attribution methods fail to scale beyond modest agent counts. We adapt the Aumann–Shapley path integral attribution framework to large language model–driven multi-agent simulations, presenting the first scalable attribution method that simultaneously satisfies four fundamental attribution axioms while handling up to millions of agents. We introduce the Attribution Scale Bias Theorem, rigorously proving that attribution results derived from small-scale samples cannot be post-processed to recover full-scale attributions. Experiments on the Bluesky dataset with 1.67 million users reveal structural discrepancies between full-scale and subsampled attributions, empirically validating the theoretical necessity of full-scale attribution for nonlinear macroscopic metrics.

📝 Abstract

Large language models (LLMs) can simulate human-like reasoning and decision-making in individual agents. LLM-powered multi-agent systems (MAS) combine such agents to simulate population-scale social phenomena such as polarization, information cascades, and market panics. Such studies require attributing macro emergence to individual agents, but existing axiomatic methods scale combinatorially in $N$ and have been confined to $N \lesssim 10^3$, while the phenomena they explain occur at $N \geq 10^6$. We address this gap by adapting Aumann--Shapley path-integral attribution to LLM-powered MAS at million-agent scale; the resulting method satisfies all four axioms, runs four to five orders of magnitude faster than sampled Shapley on the same hardware. We use this method to test the scale gap empirically: across 14 days of public Bluesky data ($1{,}671{,}587$ active users), we compute the attribution at both full scale and the visibility-biased $N = 10^2$ convenience sample used by small-scale studies, and the two disagree structurally. At full scale the long tail and middle tier jointly carry the majority; the biased small panel attributes almost everything to a few high-follower accounts. We then prove that under any nonlinear macro indicator the disagreement cannot be reduced by post-hoc rescaling: an Attribution Scaling Bias theorem shows that no global rescaling factor can reconcile small-scale and full-scale attribution. Full-scale attribution is therefore not a methodological choice but a theoretical requirement for any nonlinear macro indicator.

Problem

Research questions and friction points this paper is trying to address.

multi-agent systems

emergence attribution

large language models

scalability

macro-level phenomena

Innovation

Methods, ideas, or system contributions that make the work stand out.

Aumann-Shapley attribution

million-agent systems

emergence attribution