Counterfactual Fairness through Transforming Data Orthogonal to Bias

๐Ÿ“… 2024-03-26
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 2
โœจ Influential: 0
๐Ÿ“„ PDF

career value

194K/year
๐Ÿค– AI Summary
Machine learning models often exhibit counterfactual unfairness due to multiple continuous sensitive attributes (e.g., age, income). To address this, we propose Orthogonalization to Bias (OB), a preprocessing algorithm grounded in structural causal models (SCMs) and the joint normality assumption. We formally proveโ€”first in the literatureโ€”that linearly projecting features onto the subspace orthogonal to sensitive variables guarantees strict counterfactual fairness. OB constitutes a model-agnostic preprocessing framework compatible with both continuous and discrete sensitive attributes, enhanced by sparse regularization to improve numerical stability. Extensive experiments on synthetic and real-world datasets demonstrate that OB reduces the Counterfactual Fairness Distance (CFD) by 42% while preserving predictive accuracy across diverse models, thereby validating its generalizability and practical efficacy.

Technology Category

Application Category

๐Ÿ“ Abstract
Machine learning models have shown exceptional prowess in solving complex issues across various domains. However, these models can sometimes exhibit biased decision-making, resulting in unequal treatment of different groups. Despite substantial research on counterfactual fairness, methods to reduce the impact of multivariate and continuous sensitive variables on decision-making outcomes are still underdeveloped. We propose a novel data pre-processing algorithm, Orthogonal to Bias (OB), which is designed to eliminate the influence of a group of continuous sensitive variables, thus promoting counterfactual fairness in machine learning applications. Our approach, based on the assumption of a jointly normal distribution within a structural causal model (SCM), demonstrates that counterfactual fairness can be achieved by ensuring the data is orthogonal to the observed sensitive variables. The OB algorithm is model-agnostic, making it applicable to a wide range of machine learning models and tasks. Additionally, it includes a sparse variant to improve numerical stability through regularization. Empirical evaluations on both simulated and real-world datasets, encompassing settings with both discrete and continuous sensitive variables, show that our methodology effectively promotes fairer outcomes without compromising accuracy.
Problem

Research questions and friction points this paper is trying to address.

Reducing bias in machine learning decision-making outcomes
Handling multivariate and continuous sensitive variables
Achieving counterfactual fairness without accuracy loss
Innovation

Methods, ideas, or system contributions that make the work stand out.

OB algorithm ensures data orthogonal to bias
Model-agnostic approach for counterfactual fairness
Sparse variant enhances numerical stability
๐Ÿ”Ž Similar Papers