Adversary-Free Counterfactual Prediction via Information-Regularized Representations

๐Ÿ“… 2025-10-17
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This paper addresses counterfactual prediction under distributional shift (i.e., assignment bias). We propose an information-theoretic representation learning framework that avoids adversarial training. Our method explicitly disentangles confounders by minimizing the mutual information between treatment variables and latent representations, thereby inducing causally invariant representations. We optimize a variational upper bound on this mutual information and jointly train a supervised decoder to formulate an end-to-end objective. Theoretically, we establish the first counterfactual prediction framework grounded in information-theoretic boundsโ€”ensuring training stability and interpretability. Practically, our approach naturally extends to dynamic decision-making settings. Experiments on synthetic benchmarks and real-world clinical datasets demonstrate substantial improvements over state-of-the-art balancing, reweighting, and adversarial methods: both counterfactual prediction error and policy evaluation bias are significantly reduced.

Technology Category

Application Category

๐Ÿ“ Abstract
We study counterfactual prediction under assignment bias and propose a mathematically grounded, information-theoretic approach that removes treatment-covariate dependence without adversarial training. Starting from a bound that links the counterfactual-factual risk gap to mutual information, we learn a stochastic representation Z that is predictive of outcomes while minimizing I(Z; T). We derive a tractable variational objective that upper-bounds the information term and couples it with a supervised decoder, yielding a stable, provably motivated training criterion. The framework extends naturally to dynamic settings by applying the information penalty to sequential representations at each decision time. We evaluate the method on controlled numerical simulations and a real-world clinical dataset, comparing against recent state-of-the-art balancing, reweighting, and adversarial baselines. Across metrics of likelihood, counterfactual error, and policy evaluation, our approach performs favorably while avoiding the training instabilities and tuning burden of adversarial schemes.
Problem

Research questions and friction points this paper is trying to address.

Addressing counterfactual prediction under assignment bias
Removing treatment-covariate dependence without adversarial training
Extending framework to dynamic settings with sequential representations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Information-regularized representations remove treatment-covariate dependence
Variational objective bounds information term with supervised decoder
Dynamic extension applies information penalty to sequential representations
๐Ÿ”Ž Similar Papers
No similar papers found.
Shiqin Tang
Shiqin Tang
Center for AI and Robotics, Chinese Academy of Sciences
Machine Learning
R
Rong Feng
Department of Computer Science, City University of Hong Kong, Hong Kong
S
Shuxin Zhuang
Department of Data Science, City University of Hong Kong, Hong Kong
H
Hongzong Li
Generative AI Research and Development Center, The Hong Kong University of Science and Technology, Hong Kong
Youzhi Zhang
Youzhi Zhang
CAIR, Hong Kong Institute of Science & Innovation, Chinese Academy of Sciences
Computational Game TheoryOptimizationMulti-agent SystemsArtificial Intelligence