Understanding Cultural Alignment in Multilingual LLMs via Natural Debate Statements

📅 2026-02-13
📈 Citations: 0
Influential: 0
📄 PDF

Technology Category

Application Category

📝 Abstract
In this work we investigate the sociocultural values learned by large language models (LLMs). We introduce a novel open-access dataset, Sociocultural Statements, constructed from natural debate statements using a multi-step methodology. The dataset is synthetically labeled to enable the quantization of sociocultural norms and beliefs that LLMs exhibit in their responses to these statements, according to the Hofstede cultural dimensions. We verify the accuracy of synthetic labels using human quality control on a representative sample. We conduct a comparative analysis between two groups of LLMs developed in different countries (U.S. and China), and use as a comparative baseline patterns observed in human measurements. Using this new dataset and the analysis above, we found that culturally-distinct LLMs reflect the values and norms of the countries in which they were developed, highlighting their inability to adapt to the sociocultural backgrounds of their users.
Problem

Research questions and friction points this paper is trying to address.

cultural alignment
multilingual LLMs
sociocultural values
Hofstede dimensions
cross-cultural adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

cultural alignment
multilingual LLMs
Sociocultural Statements dataset
Hofstede cultural dimensions
synthetic labeling
🔎 Similar Papers
No similar papers found.
V
Vlad-Andrei Negru
Department of Computer Science, Technical University of Cluj-Napoca, Cluj-Napoca, Romania
C
Camelia Lemnaru
Department of Computer Science, Technical University of Cluj-Napoca, Cluj-Napoca, Romania
Mihai Surdeanu
Mihai Surdeanu
Professor, Computer Science, University of Arizona
natural language processingapplied machine learningartificial intelligence
R
Rodica Potolea
Department of Computer Science, Technical University of Cluj-Napoca, Cluj-Napoca, Romania