Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue

📅 2025-07-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) often lack authentic empathic listening and emotionally grounded interaction in dialogue. Method: We propose a human-AI collaborative fine-tuning framework: (1) augmenting a small expert-curated empathic dialogue dataset using ChatGPT and Gemini to generate high-quality synthetic samples; and (2) introducing a dual-path evaluation protocol integrating structured sentiment analysis (VADER) with multi-dimensional expert annotation—specifically assessing emotional trajectory evolution, depth of empathic perception, and response coherence. Contribution/Results: Experiments reveal that structural sentiment alignment alone is insufficient for genuine empathy; qualitative depth significantly modulates user-perceived empathy. Systematic inter-model differences in empathic quality are observed. Our work validates the necessity of human-AI collaboration in building empathic dialogue agents and establishes a reproducible methodological paradigm for evaluating and optimizing empathic LLMs.

Technology Category

Application Category

📝 Abstract
Conversational agents have made significant progress since ELIZA, expanding their role across various domains, including healthcare, education, and customer service. As these agents become increasingly integrated into daily human interactions, the need for emotional intelligence, particularly empathetic listening, becomes increasingly essential. In this study, we explore how Large Language Models (LLMs) respond when tasked with generating emotionally rich interactions. Starting from a small dataset manually crafted by an expert to reflect empathic behavior, we extended the conversations using two LLMs: ChatGPT and Gemini. We analyzed the emotional progression of the dialogues using both sentiment analysis (via VADER) and expert assessments. While the generated conversations often mirrored the intended emotional structure, human evaluation revealed important differences in the perceived empathy and coherence of the responses. These findings suggest that emotion modeling in dialogues requires not only structural alignment in the expressed emotions but also qualitative depth, highlighting the importance of combining automated and humancentered methods in the development of emotionally competent agents.
Problem

Research questions and friction points this paper is trying to address.

Enhancing chatbots for empathetic dialogue in human interactions
Evaluating emotional intelligence in Large Language Models responses
Combining automated and human methods for emotional depth
Innovation

Methods, ideas, or system contributions that make the work stand out.

Fine-tuning LLMs for empathetic dialogue generation
Extending conversations using ChatGPT and Gemini
Combining sentiment analysis and expert assessments
🔎 Similar Papers
No similar papers found.
P
Paulo Ricardo Knob
Pontifícia Universidade Católica do Rio Grande do Sul
L
Leonardo Scholler
Pontifícia Universidade Católica do Rio Grande do Sul
J
Juliano Rigatti
Nelogica Sistemas de Software Ltda
Soraia Raupp Musse
Soraia Raupp Musse
Full Professor, Pontifical Catholic University of Rio Grande do Sul
Crowd SimulationPedestrian simulationComputer AnimationVirtual humansVisual perception