INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages

πŸ“… 2025-02-13
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing multilingual dialogue benchmarks exhibit strong bias toward high-resource and Western-centric languages, neglecting cultural appropriateness and linguistic diversity in low-resource African languages. Method: We introduce AFRICA-NLUβ€”the first open-source, fully localized intent classification and slot filling benchmark covering 16 African languages, with utterances authored by native speakers and grounded in authentic scenarios (e.g., banking, travel). Unlike translation-based approaches, it employs a hybrid annotation pipeline combining multi-round human verification and GPT-4o-assisted labeling. Contribution/Results: We conduct the first systematic evaluation of mainstream LLMs and fine-tuned models on African languages, revealing substantial performance gaps: GPT-4o achieves only 26.0 F1 for slot filling and 70.6% intent accuracy, whereas a culturally adapted multilingual Transformer model attains 81.2% F1 and 85.7% accuracy. These results demonstrate that natively collected, culturally grounded data yields critical gains for cross-lingual NLU transfer, establishing a new paradigm for low-resource language evaluation and modeling.

Technology Category

Application Category

πŸ“ Abstract
Slot-filling and intent detection are well-established tasks in Conversational AI. However, current large-scale benchmarks for these tasks often exclude evaluations of low-resource languages and rely on translations from English benchmarks, thereby predominantly reflecting Western-centric concepts. In this paper, we introduce Injongo -- a multicultural, open-source benchmark dataset for 16 African languages with utterances generated by native speakers across diverse domains, including banking, travel, home, and dining. Through extensive experiments, we benchmark the fine-tuning multilingual transformer models and the prompting large language models (LLMs), and show the advantage of leveraging African-cultural utterances over Western-centric utterances for improving cross-lingual transfer from the English language. Experimental results reveal that current LLMs struggle with the slot-filling task, with GPT-4o achieving an average performance of 26 F1-score. In contrast, intent detection performance is notably better, with an average accuracy of 70.6%, though it still falls behind the fine-tuning baselines. Compared to the English language, GPT-4o and fine-tuning baselines perform similarly on intent detection, achieving an accuracy of approximately 81%. Our findings suggest that the performance of LLMs is still behind for many low-resource African languages, and more work is needed to further improve their downstream performance.
Problem

Research questions and friction points this paper is trying to address.

Multicultural intent detection
Slot-filling for African languages
Cross-lingual transfer improvement
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multicultural African language dataset
Fine-tuning multilingual transformer models
Prompting large language models
πŸ”Ž Similar Papers
No similar papers found.
H
Hao Yu
McGill University, Canada; Mila, Quebec AI Institute, Canada
Jesujoba Oluwadara Alabi
Jesujoba Oluwadara Alabi
Saarland University
Natural Language ProcessingNeural Machine TranslationMachine LearningInformation Extraction
A
Andiswa Bukula
Masakhane NLP; SADiLaR, South Africa
Z
Zhuang Yun Jian
University of Toronto, Canada
En-Shiun Annie Lee
En-Shiun Annie Lee
Ontario Tech University, and University of Toronto (Status-Only)
Natural Language ProcessingData MiningPattern Analysis
T
Tadesse Kebede Guge
Masakhane NLP
Israel Abebe Azime
Israel Abebe Azime
Saarland University
NLP | Multimodal learning | Deep Learning Applications
H
Happy Buzaaba
Masakhane NLP; Princeton University, USA
B
Blessing K. Sibanda
Masakhane NLP
G
Godson Kalipe
Masakhane NLP
J
Jonathan Mukiibi
Masakhane NLP; Makerere University, Uganda
S
S. Kabenamualu
L3S Research Center, Germany
M
M. Setaka
SADiLaR, South Africa
L
Lolwethu Ndolela
Masakhane NLP
N
N. Odu
Masakhane NLP
R
Rooweither Mabuya
Masakhane NLP; SADiLaR, South Africa
Shamsuddeen Hassan Muhammad
Shamsuddeen Hassan Muhammad
Bayero University, Kano, & Google DeepMind Academic Fellow at Imperial College London
Natural Language ProcessingSentiment AnalysisAfricaNLPLow-resource NLPMultilinguality
Salomey Osei
Salomey Osei
University of Deusto
Machine LearningNLPAuto ML
S
Sokhar Samb
Dakar American University Of Science and Technology, Senegal
J
Juliet W. Murage
Masakhane NLP
Dietrich Klakow
Dietrich Klakow
Saarland University, Saarland Informatics Campus, PharmaScienceHub
Natural Language ProcessingSpeech ProcessingQuestion AnsweringMachine Learning
David Ifeoluwa Adelani
David Ifeoluwa Adelani
McGill University and Mila - Quebec AI Institute and Canada CIFAR AI Chair
Natural language processingMultilingualityMultilingual NLPAfricaNLPLow-resource NLP