🤖 AI Summary
Prior work lacks quantitative evaluation of the topological plausibility and latent biases—particularly political homophily—in large language model (LLM)-generated social networks. Method: We propose a localized, role-by-role edge-generation prompting strategy (contrasted with global prompting), combined with zero-shot inference (using LLaMA and GPT series models) and rigorous social network metrics—including density, clustering coefficient, degree distribution, and homophily measures—to systematically assess structural realism and bias. Contribution/Results: Localized prompting significantly improves topological fidelity, bringing generated networks close to empirical benchmarks across multiple structural metrics. However, all models exhibit systematic overestimation of political homophily—by a factor of 2–5 relative to ground truth—exceeding biases observed along age, gender, or education dimensions. To our knowledge, this is the first empirical study to jointly examine structural realism and political bias in LLM-generated social networks.
📝 Abstract
Generating social networks is essential for many applications, such as epidemic modeling and social simulations. The emergence of generative AI, especially large language models (LLMs), offers new possibilities for social network generation: LLMs can generate networks without additional training or need to define network parameters, and users can flexibly define individuals in the network using natural language. However, this potential raises two critical questions: 1) are the social networks generated by LLMs realistic, and 2) what are risks of bias, given the importance of demographics in forming social ties? To answer these questions, we develop three prompting methods for network generation and compare the generated networks to a suite of real social networks. We find that more realistic networks are generated with"local"methods, where the LLM constructs relations for one persona at a time, compared to"global"methods that construct the entire network at once. We also find that the generated networks match real networks on many characteristics, including density, clustering, connectivity, and degree distribution. However, we find that LLMs emphasize political homophily over all other types of homophily and significantly overestimate political homophily compared to real social networks.