Causal Graph Learning via Distributional Invariance of Cause-Effect Relationship

๐Ÿ“… 2026-02-03
๐Ÿ›๏ธ Trans. Mach. Learn. Res.
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenge of efficiently and accurately recovering causal graph structures from observational data by proposing a distributional invarianceโ€“based causal discovery method. Leveraging the invariance of conditional distributions of causal effects across different environments or prior shifts, the approach identifies causal relationships through stability tests over multiple resampled subsets. It further incorporates sparsity assumptions of the underlying causal graph to design a quadratic-complexity optimization algorithm. The proposed method substantially reduces computational overhead, achieving up to a 25-fold speedup on large-scale benchmark datasets while maintaining accuracy comparable to or better than state-of-the-art approaches, thereby significantly enhancing the scalability of causal discovery.

Technology Category

Application Category

๐Ÿ“ Abstract
This paper introduces a new framework for recovering causal graphs from observational data, leveraging the observation that the distribution of an effect, conditioned on its causes, remains invariant to changes in the prior distribution of those causes. This insight enables a direct test for potential causal relationships by checking the variance of their corresponding effect-cause conditional distributions across multiple downsampled subsets of the data. These subsets are selected to reflect different prior cause distributions, while preserving the effect-cause conditional relationships. Using this invariance test and exploiting an (empirical) sparsity of most causal graphs, we develop an algorithm that efficiently uncovers causal relationships with quadratic complexity in the number of observational variables, reducing the processing time by up to 25x compared to state-of-the-art methods. Our empirical experiments on a varied benchmark of large-scale datasets show superior or equivalent performance compared to existing works, while achieving enhanced scalability.
Problem

Research questions and friction points this paper is trying to address.

causal graph learning
observational data
cause-effect relationship
distributional invariance
Innovation

Methods, ideas, or system contributions that make the work stand out.

causal graph learning
distributional invariance
conditional distribution
causal discovery
scalable algorithm
๐Ÿ”Ž Similar Papers
No similar papers found.
N
Nang Hung Nguyen
The University of Tokyo, Japan
P
Phi Le Nguyen
Institute for AI Innovation and Societal Impact (AI4LIFE), Vietnam Hanoi University of Science and Technology, Vietnam
T
T. Truong
National Institute of Advanced Industrial Science and Technology (AIST), Japan
T
T. Hoang
School of Electrical Engineering and Computer Science, Voiland College of Engineering and Architecture, Washington State University, Pullman, Washington, US
Masashi Sugiyama
Masashi Sugiyama
Director, RIKEN Center for Advanced Intelligence Project / Professor, The University of Tokyo
Machine LearningData MiningArtificial Intelligence