Operationalizing Longitudinal Causal Discovery Under Real-World Workflow Constraints

📅 2026-02-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge that real-world workflows in large-scale longitudinal systems generate data lacking formal temporal constraints, leading to an excessively large and empirically inconsistent causal graph search space. To resolve this, the authors propose a workflow-induced constraint framework that explicitly encodes workflow-consistent partial orderings through a structural mask derived from the workflow, a time-aligned indexing scheme, and a block-wise measurement structure. This approach enhances identifiability, interpretability, and consistency of causal structures in mixed discrete-continuous panel data without requiring new optimization algorithms. It further supports interventional queries and bootstrap-based uncertainty quantification for lagged effects. Applied to a Japanese health screening cohort of 107,261 individuals, the workflow-constrained longitudinal LiNGAM model uncovered temporally coherent substructures and lagged effects with well-calibrated uncertainties, with sensitivity analyses confirming result robustness.

Technology Category

Application Category

📝 Abstract
Causal discovery has achieved substantial theoretical progress, yet its deployment in large-scale longitudinal systems remains limited. A key obstacle is that operational data are generated under institutional workflows whose induced partial orders are rarely formalized, enlarging the admissible graph space in ways inconsistent with the recording process. We characterize a workflow-induced constraint class for longitudinal causal discovery that restricts the admissible directed acyclic graph space through protocol-derived structural masks and timeline-aligned indexing. Rather than introducing a new optimization algorithm, we show that explicitly encoding workflow-consistent partial orders reduces structural ambiguity, especially in mixed discrete--continuous panels where within-time orientation is weakly identified. The framework combines workflow-derived admissible-edge constraints, measurement-aligned time indexing and block structure, bootstrap-based uncertainty quantification for lagged total effects, and a dynamic representation supporting intervention queries. In a nationwide annual health screening cohort in Japan with 107,261 individuals and 429,044 person-years, workflow-constrained longitudinal LiNGAM yields temporally consistent within-time substructures and interpretable lagged total effects with explicit uncertainty. Sensitivity analyses using alternative exposure and body-composition definitions preserve the main qualitative patterns. We argue that formalizing workflow-derived constraint classes improves structural interpretability without relying on domain-specific edge specification, providing a reproducible bridge between operational workflows and longitudinal causal discovery under standard identifiability assumptions.
Problem

Research questions and friction points this paper is trying to address.

causal discovery
longitudinal data
workflow constraints
structural ambiguity
partial orders
Innovation

Methods, ideas, or system contributions that make the work stand out.

workflow constraints
longitudinal causal discovery
structural identifiability
partial order
LiNGAM
🔎 Similar Papers
No similar papers found.
T
Tadahisa Okuda
Kyoto University Graduate School of Medicine, Kyoto, Japan
S
Shohei Shimizu
SANKEN, The University of Osaka, Osaka, Japan; Faculty of Data Science, Shiga University, Shiga, Japan; AIP, RIKEN, Tokyo, Japan
Thong Pham
Thong Pham
Associate Professor, University of South Australia
PrefabricationBlast and Impact EngineeringProtective StructuresFRPSustainable Materials
T
Tatsuyoshi Ikenoue
Faculty of Medicine, University of Miyazaki, Miyazaki, Japan
S
Shingo Fukuma
Kyoto University Graduate School of Medicine, Kyoto, Japan; Hiroshima University Graduate School of Biomedical and Health Sciences, Hiroshima, Japan