A Cookbook for Community-driven Data Collection of Impaired Speech in LowResource Languages

📅 2025-07-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Speech data for disordered speech in low-resource languages is scarce, hindering the equitable deployment of automatic speech recognition (ASR) for persons with speech disabilities. Method: We propose a community-driven paradigm for data collection and model development, piloted on Akan—the most widely spoken language in Ghana—by creating the first open-source disordered speech corpus for Akan (Akan-Disordered Speech Corpus). We accompany it with a reusable data collection “recipe,” a lightweight speech annotation tool, and adaptation guidelines. Leveraging open collaboration, local communities co-collect data and fine-tune open ASR models (Whisper, Wav2Vec 2.0) for improved articulation disorder recognition. Contribution/Results: This work establishes the first systematic, democratized pipeline for building disordered-speech ASR in low-resource languages, offering a transferable methodology and practical benchmark for disordered speech research in under-resourced linguistic contexts globally.

Technology Category

Application Category

📝 Abstract
This study presents an approach for collecting speech samples to build Automatic Speech Recognition (ASR) models for impaired speech, particularly, low-resource languages. It aims to democratize ASR technology and data collection by developing a "cookbook" of best practices and training for community-driven data collection and ASR model building. As a proof-of-concept, this study curated the first open-source dataset of impaired speech in Akan: a widely spoken indigenous language in Ghana. The study involved participants from diverse backgrounds with speech impairments. The resulting dataset, along with the cookbook and open-source tools, are publicly available to enable researchers and practitioners to create inclusive ASR technologies tailored to the unique needs of speech impaired individuals. In addition, this study presents the initial results of fine-tuning open-source ASR models to better recognize impaired speech in Akan.
Problem

Research questions and friction points this paper is trying to address.

Collecting impaired speech samples for low-resource languages
Democratizing ASR technology via community-driven data collection
Building inclusive ASR models for speech-impaired individuals
Innovation

Methods, ideas, or system contributions that make the work stand out.

Community-driven impaired speech data collection
Open-source dataset for low-resource languages
Fine-tuning ASR models for impaired speech
🔎 Similar Papers
No similar papers found.
S
Sumaya Ahmed Salihs
Department of Computer Science, University of Ghana, Ghana
I
Isaac Wiafe
Department of Computer Science, University of Ghana, Ghana
Jamal-Deen Abdulai
Jamal-Deen Abdulai
Dr of Computer Science, University of Ghana
Computer NetworkingWireless Communication SystemsSensor NetworksAI and Machine Learning
E
Elikem Doe Atsakpo
School of Computing and Engineering, University of West London, United Kingdom
G
Gifty Ayoka
Talking Tipps Africa Foundation, Ghana
Richard Cave
Richard Cave
UCL
speech recognition for people with non-standard speechMND/ALS
A
Akon Obu Ekpezu
Department of Information Processing Science, University of Oulu, Finland
Catherine Holloway
Catherine Holloway
Graduate Student, Institute for Quantum Computing
quantum opticsquantum cryptographybioinformatics
Katrin Tomanek
Katrin Tomanek
Google Research
Natural Language ProcessingActive LearningAutomatic Speech RecogitionMachine Translation
F
Fiifi Baffoe Payin Winful
Department of Computer Science, University of Ghana, Ghana