🤖 AI Summary
This work evaluates the moral reasoning capabilities of large language models (LLMs) across multilingual settings—particularly for low-resource languages such as Vietnamese—across three levels of contextual complexity: sentence-, paragraph-, and document-level. To this end, we introduce MMRB, a typologically diverse, cross-lingual, multi-level moral reasoning benchmark covering five languages. Our study is the first to empirically demonstrate that low-resource languages exert a dominant influence on multilingual alignment, challenging the “high-resource centrism” assumption. Using LLaMA-3-8B, we conduct monolingual fine-tuning integrating moral alignment with controlled toxicity injection, enabling fine-grained cross-lingual analysis. Experiments reveal a pronounced decline in moral reasoning performance with increasing contextual complexity—most severe for low-resource languages. Notably, Vietnamese accuracy improves by 17.2% post-fine-tuning, underscoring the critical role of low-resource language modeling in enhancing global alignment robustness.
📝 Abstract
In this paper, we introduce the Multilingual Moral Reasoning Benchmark (MMRB) to evaluate the moral reasoning abilities of large language models (LLMs) across five typologically diverse languages and three levels of contextual complexity: sentence, paragraph, and document. Our results show moral reasoning performance degrades with increasing context complexity, particularly for low-resource languages such as Vietnamese. We further fine-tune the open-source LLaMA-3-8B model using curated monolingual data for alignment and poisoning. Surprisingly, low-resource languages have a stronger impact on multilingual reasoning than high-resource ones, highlighting their critical role in multilingual NLP.