LLMs Should Incorporate Explicit Mechanisms for Human Empathy

📅 2026-04-12

📈 Citations: 0

✨ Influential: 0

career value

195K/year

🤖 AI Summary

This study addresses the pervasive failure of large language models to exhibit empathy in high-stakes human-centered scenarios, manifesting as emotional attenuation, mismatched granularity of empathic response, conflict avoidance, and linguistic detachment. For the first time, empathy is formalized as a first-order design objective—defined as the faithful modeling of and responsive alignment with the human perspective, encompassing intentions, emotions, and contextual cues. The work introduces four distinct mechanisms of empathic failure and proposes a three-dimensional analytical framework grounded in cognitive, cultural, and relational dimensions. Through behavioral modeling, failure mode analysis, and multidimensional evaluation, the study empirically demonstrates that even instruction-tuned, safety-aligned models exhibit systematic empathic biases. These findings advocate for integrating empathy-aware objectives, evaluation benchmarks, and training signals into the core development pipeline of large language models.

Technology Category

Application Category

📝 Abstract

This paper argues that Large Language Models (LLMs) should incorporate explicit mechanisms for human empathy. As LLMs become increasingly deployed in high-stakes human-centered settings, their success depends not only on correctness or fluency but on faithful preservation of human perspectives. Yet, current LLMs systematically fail at this requirement: even when well-aligned and policy-compliant, they often attenuate affect, misrepresent contextual salience, and rigidify relational stance in ways that distort meaning. We formalize empathy as an observable behavioral property: the capacity to model and respond to human perspectives while preserving intention, affect, and context. Under this framing, we identify four recurring mechanisms of empathic failure in contemporary LLMs--sentiment attenuation, empathic granularity mismatch, conflict avoidance, and linguistic distancing--arising as structural consequences of prevailing training and alignment practices. We further organize these failures along three dimensions: cognitive, cultural, and relational empathy, to explain their manifestation across tasks. Empirical analyses show that strong benchmark performance can mask systematic empathic distortions, motivating empathy-aware objectives, benchmarks, and training signals as first-class components of LLM development.

Problem

Research questions and friction points this paper is trying to address.

empathy

Large Language Models

human perspectives

affect preservation

relational stance

Innovation

Methods, ideas, or system contributions that make the work stand out.

empathy

large language models

alignment