Dutch Metaphor Extraction from Cancer Patients'Interviews and Forum Data using LLMs and Human in the Loop

📅 2025-11-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Metaphor identification in Dutch-language interviews and online forum texts from cancer patients remains challenging due to linguistic complexity and scarce annotated resources. Method: This study proposes a human-in-the-loop metaphor extraction framework leveraging large language models (LLMs), integrating chain-of-thought reasoning, few-shot learning, and self-prompting techniques to enhance metaphor detection performance in low-resource language settings; human-LLM collaborative verification ensures high annotation fidelity. Contribution/Results: We introduce HealthQuote.NL—the first publicly available, high-quality Dutch corpus of cancer-related metaphors—alongside reusable prompt templates and a standardized processing pipeline. Empirical evaluation demonstrates the effectiveness and robustness of LLMs for clinical metaphor identification across diverse discourse genres. The framework advances patient-centered communication, supports shared decision-making, strengthens health literacy, and provides a linguistic foundation for personalized care pathway design.

Technology Category

Application Category

📝 Abstract
Metaphors and metaphorical language (MLs) play an important role in healthcare communication between clinicians, patients, and patients'family members. In this work, we focus on Dutch language data from cancer patients. We extract metaphors used by patients using two data sources: (1) cancer patient storytelling interview data and (2) online forum data, including patients'posts, comments, and questions to professionals. We investigate how current state-of-the-art large language models (LLMs) perform on this task by exploring different prompting strategies such as chain of thought reasoning, few-shot learning, and self-prompting. With a human-in-the-loop setup, we verify the extracted metaphors and compile the outputs into a corpus named HealthQuote.NL. We believe the extracted metaphors can support better patient care, for example shared decision making, improved communication between patients and clinicians, and enhanced patient health literacy. They can also inform the design of personalized care pathways. We share prompts and related resources at https://github.com/aaronlifenghan/HealthQuote.NL
Problem

Research questions and friction points this paper is trying to address.

Extracting Dutch metaphors from cancer patient interviews and online forums
Evaluating LLM performance on metaphor detection using various prompting strategies
Creating verified metaphor corpus to improve healthcare communication and care
Innovation

Methods, ideas, or system contributions that make the work stand out.

Extracted Dutch metaphors using large language models
Applied chain of thought and few-shot prompting strategies
Verified metaphors through human-in-the-loop validation process
🔎 Similar Papers
No similar papers found.
Lifeng Han
Lifeng Han
Leiden University Medical Centre
Clinical NLPInformation ExtractionMachine TranslationMultiword Expressions
D
David Lindevelt
Leiden Institute of Advanced Computer Science (LIACS), Leiden University, NL
S
Sander Puts
Department of Radiation Oncology (MAASTRO), GROW Research Institute for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, NL
Erik van Mulligen
Erik van Mulligen
Department of Medical Informatics, Erasmus University Medical Center Rotterdam, NL
Suzan Verberne
Suzan Verberne
Leiden Institute of Advanced Computer Science, Leiden University
Information RetrievalNatural Language Processing