Hybrid Causal Identification and Causal Mechanism Clustering

📅 2025-07-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Real-world observational data often originate from heterogeneous multi-source environments, where causal relationships exhibit mechanism-level heterogeneity; however, mainstream additive noise models (ANMs) assume a single homogeneous causal mechanism and thus fail to capture such complexity. To address this, we propose the Mixture Conditional Variational Inference (MCVI) model—a novel, identifiable mixture ANM that jointly integrates Gaussian mixture models with neural networks. MCVI explicitly clusters distinct causal mechanisms while simultaneously identifying causal directions. It optimizes the evidence lower bound via a mixture conditional variational autoencoder, synergizing neural nonlinearity for flexible function approximation and Gaussian mixture structure for interpretable mechanism representation. Extensive experiments on synthetic and real-world benchmarks demonstrate that MCVI significantly outperforms state-of-the-art methods: it achieves accurate heterogeneous causal direction identification and uncovers the underlying distributional structure of latent causal mechanisms. MCVI establishes a new, interpretable, and scalable paradigm for heterogeneous causal discovery.

Technology Category

Application Category

📝 Abstract
Bivariate causal direction identification is a fundamental and vital problem in the causal inference field. Among binary causal methods, most methods based on additive noise only use one single causal mechanism to construct a causal model. In the real world, observations are always collected in different environments with heterogeneous causal relationships. Therefore, on observation data, this paper proposes a Mixture Conditional Variational Causal Inference model (MCVCI) to infer heterogeneous causality. Specifically, according to the identifiability of the Hybrid Additive Noise Model (HANM), MCVCI combines the superior fitting capabilities of the Gaussian mixture model and the neural network and elegantly uses the likelihoods obtained from the probabilistic bounds of the mixture conditional variational auto-encoder as causal decision criteria. Moreover, we model the casual heterogeneity into cluster numbers and propose the Mixture Conditional Variational Causal Clustering (MCVCC) method, which can reveal causal mechanism expression. Compared with state-of-the-art methods, the comprehensive best performance demonstrates the effectiveness of the methods proposed in this paper on several simulated and real data.
Problem

Research questions and friction points this paper is trying to address.

Identify heterogeneous causal relationships in observed data
Cluster causal mechanisms to reveal expression patterns
Improve accuracy in bivariate causal direction identification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid Additive Noise Model identifiability
Mixture Conditional Variational Auto-encoder likelihoods
Causal mechanism clustering into numbers
S
Saixiong Liu
Key Institute of Big Data Science and Industry, Shanxi University, Taiyuan, Shanxi 030006, China
Yuhua Qian
Yuhua Qian
山西大学大数据科学与产业研究院
机器学习、数据挖掘、复杂网络
J
Jue Li
Key Institute of Big Data Science and Industry, Shanxi University, Taiyuan, Shanxi 030006, China
H
Honghong Cheng
School of Information, Shanxi University of Finance and Economics, Taiyuan, Shanxi 030012, China, and also with Institute of Big Data Science and Industry, Shanxi University, Taiyuan, Shanxi 030006, China
F
Feijiang Li
Key Institute of Big Data Science and Industry, Shanxi University, Taiyuan, Shanxi 030006, China