SocioEval: A Template-Based Framework for Evaluating Socioeconomic Status Bias in Foundation Models

📅 2026-04-02
📈 Citations: 0
Influential: 0
📄 PDF

career value

190K/year
🤖 AI Summary
This study addresses the lack of a systematic evaluation framework for socioeconomic status (SES) bias in large language models (LLMs) during high-stakes decision-making. The authors formally define and quantify this bias by constructing a template-based benchmark encompassing 8 themes, 18 topics, and 240 prompts. They analyze 3,120 model responses across 13 mainstream LLMs under six class-comparison configurations through a three-stage human annotation process. Results reveal a substantial variation in bias prevalence (0.42%–33.75%), with lifestyle-related judgments exhibiting bias levels ten times higher than those in educational decisions. Furthermore, while existing mitigation strategies partially alleviate explicit discrimination, they prove ineffective against implicit stereotypes. This work establishes a scalable, fine-grained benchmark for auditing class-based biases in LLMs.

Technology Category

Application Category

📝 Abstract
As Large Language Models (LLMs) increasingly power decision-making systems across critical domains, understanding and mitigating their biases becomes essential for responsible AI deployment. Although bias assessment frameworks have proliferated for attributes such as race and gender, socioeconomic status bias remains significantly underexplored despite its widespread implications in the real world. We introduce SocioEval, a template-based framework for systematically evaluating socioeconomic bias in foundation models through decision-making tasks. Our hierarchical framework encompasses 8 themes and 18 topics, generating 240 prompts across 6 class-pair combinations. We evaluated 13 frontier LLMs on 3,120 responses using a rigorous three-stage annotation protocol, revealing substantial variation in bias rates (0.42\%-33.75\%). Our findings demonstrate that bias manifests differently across themes lifestyle judgments show 10$\times$ higher bias than education-related decisions and that deployment safeguards effectively prevent explicit discrimination but show brittleness to domain-specific stereotypes. SocioEval provides a scalable, extensible foundation for auditing class-based bias in language models.
Problem

Research questions and friction points this paper is trying to address.

socioeconomic status bias
foundation models
bias evaluation
large language models
algorithmic fairness
Innovation

Methods, ideas, or system contributions that make the work stand out.

socioeconomic bias
template-based evaluation
foundation models
bias auditing
LLM fairness