Cross-Cultural Value Awareness in Large Vision-Language Models

📅 2026-04-10
📈 Citations: 0
Influential: 0
📄 PDF

career value

197K/year
🤖 AI Summary
This study addresses the susceptibility of large vision-language models (LVLMs) to cultural cues—such as those signaling religion, nationality, or socioeconomic status—which can lead to biased moral and value judgments that lack sensitivity to cross-cultural differences. To systematically evaluate this issue, the authors propose a novel assessment framework that integrates Moral Foundations Theory with natural language lexical analysis to construct counterfactual image sets. Applying this framework to five state-of-the-art LVLMs, the work reveals a consistent overreliance on cultural symbols, resulting in outputs that significantly deviate from cross-cultural fairness. These findings underscore a critical gap in value alignment within current LVLMs, highlighting the urgent need for more culturally aware and equitable model design.

Technology Category

Application Category

📝 Abstract
The rapid adoption of large vision-language models (LVLMs) in recent years has been accompanied by growing fairness concerns due to their propensity to reinforce harmful societal stereotypes. While significant attention has been paid to such fairness concerns in the context of social biases, relatively little prior work has examined the presence of stereotypes in LVLMs related to cultural contexts such as religion, nationality, and socioeconomic status. In this work, we aim to narrow this gap by investigating how cultural contexts depicted in images influence the judgments LVLMs make about a person's moral, ethical, and political values. We conduct a multi-dimensional analysis of such value judgments in five popular LVLMs using counterfactual image sets, which depict the same person across different cultural contexts. Our evaluation framework diagnoses LVLM awareness of cultural value differences through the use of Moral Foundations Theory, lexical analyses, and the sensitivity of generated values to depicted cultural contexts.
Problem

Research questions and friction points this paper is trying to address.

cultural bias
vision-language models
stereotypes
Moral Foundations Theory
fairness
Innovation

Methods, ideas, or system contributions that make the work stand out.

cross-cultural fairness
large vision-language models
Moral Foundations Theory
counterfactual image sets
value judgment