Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs

📅 2025-04-28

📈 Citations: 0

✨ Influential: 0

career value

153K/year

🤖 AI Summary

This work evaluates the moral reasoning capabilities of large language models (LLMs) across multilingual settings—particularly for low-resource languages such as Vietnamese—across three levels of contextual complexity: sentence-, paragraph-, and document-level. To this end, we introduce MMRB, a typologically diverse, cross-lingual, multi-level moral reasoning benchmark covering five languages. Our study is the first to empirically demonstrate that low-resource languages exert a dominant influence on multilingual alignment, challenging the “high-resource centrism” assumption. Using LLaMA-3-8B, we conduct monolingual fine-tuning integrating moral alignment with controlled toxicity injection, enabling fine-grained cross-lingual analysis. Experiments reveal a pronounced decline in moral reasoning performance with increasing contextual complexity—most severe for low-resource languages. Notably, Vietnamese accuracy improves by 17.2% post-fine-tuning, underscoring the critical role of low-resource language modeling in enhancing global alignment robustness.

Technology Category

Application Category

📝 Abstract

In this paper, we introduce the Multilingual Moral Reasoning Benchmark (MMRB) to evaluate the moral reasoning abilities of large language models (LLMs) across five typologically diverse languages and three levels of contextual complexity: sentence, paragraph, and document. Our results show moral reasoning performance degrades with increasing context complexity, particularly for low-resource languages such as Vietnamese. We further fine-tune the open-source LLaMA-3-8B model using curated monolingual data for alignment and poisoning. Surprisingly, low-resource languages have a stronger impact on multilingual reasoning than high-resource ones, highlighting their critical role in multilingual NLP.

Problem

Research questions and friction points this paper is trying to address.

Evaluating LLM moral reasoning across diverse languages and complexities

Assessing performance degradation in low-resource languages like Vietnamese

Exploring low-resource languages' impact on multilingual reasoning in NLP

Innovation

Methods, ideas, or system contributions that make the work stand out.

Multilingual Moral Reasoning Benchmark (MMRB) for evaluation

Fine-tuned LLaMA-3-8B with monolingual alignment data

Low-resource languages boost multilingual reasoning performance

🔎 Similar Papers

Language Model Alignment in Multilingual Trolley Problems