🤖 AI Summary
This study investigates how generative AI, when mediating human communication, steers collective opinion formation, with a focus on the directional biases introduced by large language models (LLMs) when editing user-generated content and their amplification through social networks. By empirically auditing mainstream LLMs’ editorial tendencies on contentious issues—such as gun control, abortion, and atheism—and integrating network opinion dynamics modeling, simulations, and real-world data from social platforms (e.g., X’s “Explain this post” feature), the research systematically demonstrates that AI mediation can significantly shift group-level opinions. The work innovatively establishes an analytically tractable model linking individual-level AI biases to macroscopic opinion evolution, showing that LLM-induced biases—such as Grok’s pro-life stance on abortion—are amplified through social transmission, thereby reshaping collective discourse trajectories.
📝 Abstract
Generative artificial intelligence (AI) is increasingly integrated into the online platforms where humans exchange opinions; large language models (LLMs) now polish users'posts on LinkedIn and provide context for content shared on X. While prior work has shown that AI can express biased opinions and shape individuals'opinions during human-AI interactions, less attention has been paid to its influence on collective opinion formation when mediating human-to-human communication. We address this gap via a combination of empirical and theoretical analyses. We show empirically that LLMs from multiple popular families introduce directional biases when instructed to edit human-written texts on contested topics, for example, nudging texts in favor of gun control and against atheism. Building on this observation, we introduce a mathematical model of opinion dynamics in which an AI system sits between users on a social network, transforming the opinions they express and perceive. By analytically characterizing the equilibrium of this model and performing simulations on real social network data, we show that biases introduced by AI in human-to-human communication can be amplified through the network and shift collective opinion in their direction. In light of these findings, we investigate whether such biases are controllable by online platforms. We audit the"Explain this post"feature on X and find evidence of pro-life bias in Grok's outputs on abortion-related content, which we trace back to specific design choices. We conclude with a discussion of the broader implications of our findings in relation to ongoing legislative efforts in the European Union.