Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties

๐Ÿ“… 2026-01-07
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study addresses the challenges of cross-lingual automatic speech recognition (ASR) for low-resource Indo-Aryan dialects, which are characterized by spontaneity, ambient noise, and code-mixing. Through empirical evaluation, the work systematically assesses the transfer performance of modern ASR models across multiple Indian dialects, with a particular focus on the influence of phylogenetic distance and dialectal data quantity. Using Garhwali as a case study, the research demonstrates that phylogenetic proximity alone does not dictate transfer efficacy; remarkably, even a small amount of dialect-specific fine-tuning data can yield substantial performance gainsโ€”sometimes matching models trained extensively on high-resource standard languages. Furthermore, the study uncovers systematic transcription biases in current ASR systems against non-standardized speech varieties, highlighting a critical gap in robustness and inclusivity.

Technology Category

Application Category

๐Ÿ“ Abstract
We conduct an empirical study of cross-lingual transfer using spontaneous, noisy, and code-mixed speech across a wide range of Indic dialects and language varieties. Our results indicate that although ASR performance is generally improved with reduced phylogenetic distance between languages, this factor alone does not fully explain performance in dialectal settings. Often, fine-tuning on smaller amounts of dialectal data yields performance comparable to fine-tuning on larger amounts of phylogenetically-related, high-resource standardized languages. We also present a case study on Garhwali, a low-resource Pahari language variety, and evaluate multiple contemporary ASR models. Finally, we analyze transcription errors to examine bias toward pre-training languages, providing additional insight into challenges faced by ASR systems on dialectal and non-standardized speech.
Problem

Research questions and friction points this paper is trying to address.

dialect
low-resource
cross-lingual ASR
Indic languages
non-standardized speech
Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-lingual transfer
low-resource ASR
dialectal speech
code-mixed speech
Garhwali
๐Ÿ”Ž Similar Papers
No similar papers found.
A
Akriti Dhasmana
Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, USA
A
Aarohi Srivastava
Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, USA
David Chiang
David Chiang
Associate Professor, University of Notre Dame
Natural Language ProcessingMachine Translation