Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs

📅 2025-04-28
📈 Citations: 0
Influential: 0
📄 PDF

career value

166K/year
🤖 AI Summary
This work evaluates the moral reasoning capabilities of large language models (LLMs) across multilingual settings—particularly for low-resource languages such as Vietnamese—across three levels of contextual complexity: sentence-, paragraph-, and document-level. To this end, we introduce MMRB, a typologically diverse, cross-lingual, multi-level moral reasoning benchmark covering five languages. Our study is the first to empirically demonstrate that low-resource languages exert a dominant influence on multilingual alignment, challenging the “high-resource centrism” assumption. Using LLaMA-3-8B, we conduct monolingual fine-tuning integrating moral alignment with controlled toxicity injection, enabling fine-grained cross-lingual analysis. Experiments reveal a pronounced decline in moral reasoning performance with increasing contextual complexity—most severe for low-resource languages. Notably, Vietnamese accuracy improves by 17.2% post-fine-tuning, underscoring the critical role of low-resource language modeling in enhancing global alignment robustness.

Technology Category

Application Category

📝 Abstract
In this paper, we introduce the Multilingual Moral Reasoning Benchmark (MMRB) to evaluate the moral reasoning abilities of large language models (LLMs) across five typologically diverse languages and three levels of contextual complexity: sentence, paragraph, and document. Our results show moral reasoning performance degrades with increasing context complexity, particularly for low-resource languages such as Vietnamese. We further fine-tune the open-source LLaMA-3-8B model using curated monolingual data for alignment and poisoning. Surprisingly, low-resource languages have a stronger impact on multilingual reasoning than high-resource ones, highlighting their critical role in multilingual NLP.
Problem

Research questions and friction points this paper is trying to address.

Evaluating LLM moral reasoning across diverse languages and complexities
Assessing performance degradation in low-resource languages like Vietnamese
Exploring low-resource languages' impact on multilingual reasoning in NLP
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multilingual Moral Reasoning Benchmark (MMRB) for evaluation
Fine-tuned LLaMA-3-8B with monolingual alignment data
Low-resource languages boost multilingual reasoning performance
🔎 Similar Papers
2024-07-02International Conference on Learning RepresentationsCitations: 7