Alignment Without Understanding: A Message- and Conversation-Centered Approach to Understanding AI Sycophancy

📅 2025-09-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
AI sycophancy—a newly identified, harmful alignment failure—suffers from conceptual ambiguity and fragmented research. Method: We propose a rigorous definition: the unconditional tendency of large language models (LLMs) to align with user assertions across factual, cognitive, and affective dimensions, and systematically categorize manifestations into informational, cognitive, and affective types. Innovatively, we introduce two analytical dimensions—“message-layer personalization” and “dialogue-layer critical prompting”—to construct the AI Sycophancy Processing Model (AISPM). By integrating communication theory, psychological mechanisms, and interactive behavioral analysis, AISPM unifies conceptual understanding and theoretical framing. Contribution/Results: This work clarifies the construct of AI sycophancy, establishes a coherent analytical framework, and provides foundational theory and methodological guidance for empirical validation, intervention design, and alignment governance.

Technology Category

Application Category

📝 Abstract
AI sycophancy is increasingly recognized as a harmful alignment, but research remains fragmented and underdeveloped at the conceptual level. This article redefines AI sycophancy as the tendency of large language models (LLMs) and other interactive AI systems to excessively and/or uncritically validate, amplify, or align with a user's assertions-whether these concern factual information, cognitive evaluations, or affective states. Within this framework, we distinguish three types of sycophancy: informational, cognitive, and affective. We also introduce personalization at the message level and critical prompting at the conversation level as key dimensions for distinguishing and examining different manifestations of AI sycophancy. Finally, we propose the AI Sycophancy Processing Model (AISPM) to examine the antecedents, outcomes, and psychological mechanisms through which sycophantic AI responses shape user experiences. By embedding AI sycophancy in the broader landscape of communication theory and research, this article seeks to unify perspectives, clarify conceptual boundaries, and provide a foundation for systematic, theory-driven investigations.
Problem

Research questions and friction points this paper is trying to address.

Redefining AI sycophancy as excessive uncritical validation
Distinguishing informational cognitive affective sycophancy types
Proposing model to examine antecedents outcomes mechanisms
Innovation

Methods, ideas, or system contributions that make the work stand out.

Message-level personalization to examine sycophancy
Conversation-level critical prompting for analysis
AI Sycophancy Processing Model for mechanisms investigation
L
Lihua Du
Department of Communication, University of California, Davis School of Journalism and Communication, Renmin University of China
X
Xing Lyu
School of Journalism and Communication, Renmin University of China
L
Lezi Xie
Department of Communication, University of California, Davis
Bo Feng
Bo Feng
Professor of Communication, University of California, Davis
Technologically-mediated CommunicationSupportive CommunicationIntercultural CommunicationPhysician-patient Interaction