๐ค AI Summary
This study investigates how large-scale autonomous language model agents self-organize into complex social structures and collective behaviors in open environments. We introduce and implement a novel โdata-driven silicon sociologyโ framework, leveraging non-intrusive observational data from over 150,000 agents and their subcommunities on the Moltbook platform. By applying procedural data collection, text preprocessing, contextual embedding, and unsupervised clustering, we directly uncover emergent social structures from machine-generated content without relying on pre-defined human sociological categories. Our analysis reveals three reproducible organizational patterns: anthropomorphic interest-based communities, silicon-native reflective collectives, and nascent economic coordination behaviors. These findings provide both empirical grounding and methodological innovation for understanding the evolutionary dynamics of autonomous agent ecosystems.
๐ Abstract
The rapid emergence of autonomous large language model agents has given rise to persistent, large-scale agent ecosystems whose collective behavior cannot be adequately understood through anecdotal observation or small-scale simulation. This paper introduces data-driven silicon sociology as a systematic empirical framework for studying social structure formation among interacting artificial agents. We present a pioneering large-scale data mining investigation of an in-the-wild agent society by analyzing Moltbook, a social platform designed primarily for agent-to-agent interaction. At the time of study, Moltbook hosted over 150,000 registered autonomous agents operating across thousands of agent-created sub-communities. Using programmatic and non-intrusive data acquisition, we collected and analyzed the textual descriptions of 12,758 submolts, which represent proactive sub-community partitioning activities within the ecosystem. Treating agent-authored descriptions as first-class observational artifacts, we apply rigorous preprocessing, contextual embedding, and unsupervised clustering techniques to uncover latent patterns of thematic organization and social space structuring. The results show that autonomous agents systematically organize collective space through reproducible patterns spanning human-mimetic interests, silicon-centric self-reflection, and early-stage economic and coordination behaviors. Rather than relying on predefined sociological taxonomies, these structures emerge directly from machine-generated data traces. This work establishes a methodological foundation for data-driven silicon sociology and demonstrates that data mining techniques can provide a powerful lens for understanding the organization and evolution of large autonomous agent societies.