🤖 AI Summary
This work addresses the challenges of learning directed acyclic graphs (DAGs) from observational data, particularly under heteroscedastic noise or distribution shifts, where combinatorial complexity and causal non-identifiability are exacerbated. The authors reformulate DAG learning as a continuous optimization problem over adjacency matrices and propose an adjoint-based DAG estimation method that jointly models sparse causal structures and exogenous noise levels, thereby adapting to heterogeneous noise conditions. By integrating noise adaptivity, sparsity, and non-negativity constraints with a smooth characterization of acyclicity and high-dimensional statistical techniques, the approach enhances model robustness while alleviating identifiability issues. Empirical results demonstrate that the proposed method significantly improves both the accuracy of causal structure recovery and algorithmic scalability in scenarios involving heteroscedasticity and distributional shifts.
📝 Abstract
Directed acyclic graphs (DAGs) constitute a central modeling tool to enable principled reasoning about cause-effect interactions in complex systems. However, since the causal structure underlying a group of variables is often unknown and interventions may be infeasible or ethically challenging to implement, there is a need to address the task of inferring DAGs from observational data. However, most classical structure identification approaches face two key obstacles: the combinatorial challenge of enforcing acyclicity, which severely limits scalability, and identifiability challenges arising from latent confounding or heterogeneous noise. This tutorial offers an overview of recent signal processing and optimization advances that address these issues by recasting DAG structure learning as a continuous, score-based estimation problem over adjacency matrices. We begin with a didactic introduction to structural equation models and the formulation of causal graph recovery, followed by a historical survey of score-based methods ranging from early combinatorial search schemes and greedy heuristics to modern continuous frameworks that leverage smooth characterizations of acyclicity. Building on this foundation, we describe concomitant DAG estimation methods that jointly infer sparse causal structure and exogenous noise levels, improving robustness under heteroscedasticity and distribution shifts by rendering the estimator noise adaptive. All in all, the tutorial introduces readers to challenges and opportunities for signal processing research at the crossroads of causal inference, high-dimensional statistics, and scalable graph learning, while outlining emerging directions including online, nonlinear, and neural causal discovery.