A Proof System with Causal Labels (Part II): checking Counterfactual Fairness

📅 2025-07-19

📈 Citations: 0

✨ Influential: 0

career value

170K/year

🤖 AI Summary

This paper addresses the problem of verifying counterfactual fairness for probabilistic classifiers. We propose a formal method based on typed natural deduction, extending the TNDPQ calculus with causally annotated structural conditions that embed structural causal models and probabilistic reasoning into the type system. This enables rigorous logical characterization of the counterfactual proposition: “Would the decision remain unchanged if a sensitive attribute were altered?” The resulting labeled proof system supports automated derivation and formally verifiable fairness certification, overcoming key limitations of traditional statistical fairness definitions—namely, their lack of causal semantics and formal provability. Empirical evaluation demonstrates that our framework effectively detects latent counterfactual unfairness in black-box classifiers, providing an interpretable and formally grounded mechanism for fairness assurance in trustworthy AI systems. (149 words)

Technology Category

Application Category

📝 Abstract

In this article we propose an extension to the typed natural deduction calculus TNDPQ to model verification of counterfactual fairness in probabilistic classifiers. This is obtained formulating specific structural conditions for causal labels and checking that evaluation is robust under their variation.

Problem

Research questions and friction points this paper is trying to address.

Extending TNDPQ calculus to verify counterfactual fairness

Modeling structural conditions for causal labels variation

Checking robustness of probabilistic classifier evaluations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends TNDPQ calculus for fairness verification

Formulates structural conditions for causal labels

Checks evaluation robustness under label variation

🔎 Similar Papers

Causal Concept Graph Models: Beyond Causal Opacity in Deep Learning