Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation

📅 2025-10-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the high manual cost and scarcity of annotated data in generating Easy-to-Read (ETR) texts for people with cognitive disabilities. We propose an automated ETR generation framework integrating Retrieval-Augmented Generation (RAG), Multi-Task Learning (MTL), and Low-Rank Adaptation (LoRA). Our method jointly models summarization, text simplification, and readability transformation tasks, fine-tuning Mistral-7B and LLaMA-3-8B via MTL-LoRA while leveraging RAG to alleviate domain generalization bottlenecks. Experiments on the high-quality French ETR-fr benchmark demonstrate that: (1) the multi-task approach consistently outperforms single-task baselines; (2) RAG substantially improves cross-domain generalization; and (3) MTL-LoRA achieves state-of-the-art performance in in-domain settings. The framework delivers a scalable, robust, LLM-driven solution for equitable information access in resource-constrained scenarios.

Technology Category

Application Category

📝 Abstract
Simplifying complex texts is essential for ensuring equitable access to information, especially for individuals with cognitive impairments. The Easy-to-Read (ETR) initiative offers a framework for making content accessible to the neurodivergent population, but the manual creation of such texts remains time-consuming and resource-intensive. In this work, we investigate the potential of large language models (LLMs) to automate the generation of ETR content. To address the scarcity of aligned corpora and the specificity of ETR constraints, we propose a multi-task learning (MTL) approach that trains models jointly on text summarization, text simplification, and ETR generation. We explore two different strategies: multi-task retrieval-augmented generation (RAG) for in-context learning, and MTL-LoRA for parameter-efficient fine-tuning. Our experiments with Mistral-7B and LLaMA-3-8B, based on ETR-fr, a new high-quality dataset, demonstrate the benefits of multi-task setups over single-task baselines across all configurations. Moreover, results show that the RAG-based strategy enables generalization in out-of-domain settings, while MTL-LoRA outperforms all learning strategies within in-domain configurations.
Problem

Research questions and friction points this paper is trying to address.

Automating Easy-to-Read text generation for cognitive accessibility
Addressing data scarcity and specificity via multi-task learning
Enhancing generalization and performance with novel training strategies
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-task learning for text summarization and simplification
Retrieval-augmented generation for out-of-domain generalization
Parameter-efficient fine-tuning with MTL-LoRA approach
🔎 Similar Papers
No similar papers found.
F
François Ledoyen
Université Caen Normandie, ENSICAEN, CNRS, Normandie Univ, GREYC UMR 6072, F-14000 Caen, France
Gaël Dias
Gaël Dias
Full Professor, Normandie Univ, UNICAEN, ENSICAEN, CNRS, GREYC
Natural Language ProcessingInformation RetrievalAffective Computing
J
Jérémie Pantin
Université Caen Normandie, ENSICAEN, CNRS, Normandie Univ, GREYC UMR 6072, F-14000 Caen, France
Alexis Lechervy
Alexis Lechervy
Assistant Professor, GREYC, CNRS, Université de Caen
Computer VisionMultimedia IndexingMachine Learning
F
Fabrice Maurel
Université Caen Normandie, ENSICAEN, CNRS, Normandie Univ, GREYC UMR 6072, F-14000 Caen, France
Y
Youssef Chahir
Université Caen Normandie, ENSICAEN, CNRS, Normandie Univ, GREYC UMR 6072, F-14000 Caen, France