A Cookbook for Community-driven Data Collection of Impaired Speech in LowResource Languages

📅 2025-07-03

📈 Citations: 0

✨ Influential: 0

career value

165K/year

🤖 AI Summary

Speech data for disordered speech in low-resource languages is scarce, hindering the equitable deployment of automatic speech recognition (ASR) for persons with speech disabilities. Method: We propose a community-driven paradigm for data collection and model development, piloted on Akan—the most widely spoken language in Ghana—by creating the first open-source disordered speech corpus for Akan (Akan-Disordered Speech Corpus). We accompany it with a reusable data collection “recipe,” a lightweight speech annotation tool, and adaptation guidelines. Leveraging open collaboration, local communities co-collect data and fine-tune open ASR models (Whisper, Wav2Vec 2.0) for improved articulation disorder recognition. Contribution/Results: This work establishes the first systematic, democratized pipeline for building disordered-speech ASR in low-resource languages, offering a transferable methodology and practical benchmark for disordered speech research in under-resourced linguistic contexts globally.

Technology Category

Application Category

📝 Abstract

This study presents an approach for collecting speech samples to build Automatic Speech Recognition (ASR) models for impaired speech, particularly, low-resource languages. It aims to democratize ASR technology and data collection by developing a "cookbook" of best practices and training for community-driven data collection and ASR model building. As a proof-of-concept, this study curated the first open-source dataset of impaired speech in Akan: a widely spoken indigenous language in Ghana. The study involved participants from diverse backgrounds with speech impairments. The resulting dataset, along with the cookbook and open-source tools, are publicly available to enable researchers and practitioners to create inclusive ASR technologies tailored to the unique needs of speech impaired individuals. In addition, this study presents the initial results of fine-tuning open-source ASR models to better recognize impaired speech in Akan.

Problem

Research questions and friction points this paper is trying to address.

Collecting impaired speech samples for low-resource languages

Democratizing ASR technology via community-driven data collection

Building inclusive ASR models for speech-impaired individuals

Innovation

Methods, ideas, or system contributions that make the work stand out.

Community-driven impaired speech data collection

Open-source dataset for low-resource languages

Fine-tuning ASR models for impaired speech

🔎 Similar Papers

Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages

2024-09-13arXiv.orgCitations: 1

💼 Related Jobs

No related jobs found.

Authors to Follow