Answer Bubbles: Information Exposure in AI-Mediated Search

📅 2026-03-17

📈 Citations: 0

✨ Influential: 0

career value

198K/year

🤖 AI Summary

This study reveals significant biases in generative search systems regarding source selection, linguistic expression, and citation fidelity, which may give rise to “answer bubbles”—divergent information realities for identical queries across different systems. Analyzing 11,000 real-world queries processed by four system types, the authors conduct a large-scale empirical evaluation using natural language processing techniques, focusing on source diversity, summary language characteristics, and faithfulness to original sources. The research uncovers that generative systems disproportionately rely on Wikipedia and long-form texts while underrepresenting social media and negatively valenced sources. Although integrating retrieval mechanisms reduces speculative language by 60%, confident yet potentially unwarranted assertions persist, amplifying exposure bias. The paper introduces the concept of “answer bubbles,” offering a novel lens for understanding the informational ecosystem risks inherent in AI-generated content.

Technology Category

Application Category

📝 Abstract

Generative search systems are increasingly replacing link-based retrieval with AI-generated summaries, yet little is known about how these systems differ in sources, language, and fidelity to cited material. We examine responses to 11,000 real search queries across four systems -- vanilla GPT, Search GPT, Google AI Overviews, and traditional Google Search -- at three levels: source diversity, linguistic characterization of the generated summary, and source-summary fidelity. We find that generative search systems exhibit significant \textit{source-selection} biases in their citations, favoring certain sources over others. Incorporating search also selectively attenuates epistemic markers, reducing hedging by up to 60\% while preserving confidence language in the AI-generated summaries. At the same time, AI summaries further compound the citation biases: Wikipedia and longer sources are disproportionately overrepresented, whereas cited social media content and negatively framed sources are substantially underrepresented. Our findings highlight the potential for \textit{answer bubbles}, in which identical queries yield structurally different information realities across systems, with implications for user trust, source visibility, and the transparency of AI-mediated information access.

Problem

Research questions and friction points this paper is trying to address.

generative search

source bias

answer bubbles

information exposure

AI-mediated search

Innovation

Methods, ideas, or system contributions that make the work stand out.

answer bubbles

source-selection bias

generative search