ProCause: Generating Counterfactual Outcomes to Evaluate Prescriptive Process Monitoring Methods

📅 2025-08-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing prescriptive process monitoring (PresPM) evaluation methods suffer from the absence of counterfactual outcomes and fail to model temporal dependencies; approaches like RealCause rely solely on a single TARNet architecture and neglect sequential process dynamics. Method: This paper proposes a counterfactual generation framework integrating temporal modeling and multi-model ensembling. It employs LSTM to capture time-series dependencies in process data and combines S-Learner, T-Learner, TARNet, and ensemble learning strategies into a robust deep generative model. Contribution/Results: Experiments on both synthetic and real-world clinical process datasets demonstrate that the proposed method significantly improves counterfactual prediction accuracy and evaluation stability. It provides a more reliable, generalizable, and temporally aware benchmark for assessing PresPM techniques—addressing critical limitations in current causal inference–based process monitoring evaluation.

Technology Category

Application Category

📝 Abstract
Prescriptive Process Monitoring (PresPM) is the subfield of Process Mining that focuses on optimizing processes through real-time interventions based on event log data. Evaluating PresPM methods is challenging due to the lack of ground-truth outcomes for all intervention actions in datasets. A generative deep learning approach from the field of Causal Inference (CI), RealCause, has been commonly used to estimate the outcomes for proposed intervention actions to evaluate a new policy. However, RealCause overlooks the temporal dependencies in process data, and relies on a single CI model architecture, TARNet, limiting its effectiveness. To address both shortcomings, we introduce ProCause, a generative approach that supports both sequential (e.g., LSTMs) and non-sequential models while integrating multiple CI architectures (S-Learner, T-Learner, TARNet, and an ensemble). Our research using a simulator with known ground truths reveals that TARNet is not always the best choice; instead, an ensemble of models offers more consistent reliability, and leveraging LSTMs shows potential for improved evaluations when temporal dependencies are present. We further validate ProCause's practical effectiveness through a real-world data analysis, ensuring a more reliable evaluation of PresPM methods.
Problem

Research questions and friction points this paper is trying to address.

Evaluating PresPM methods lacks ground-truth outcomes for interventions
RealCause overlooks temporal dependencies in process event data
RealCause relies on limited single model architecture TARNet
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative deep learning with sequential models
Integrates multiple causal inference architectures
Uses ensemble models for consistent reliability
🔎 Similar Papers
No similar papers found.
J
Jakob De Moor
Research Centre for Information Systems Engineering (LIRIS), KU Leuven, Leuven, Belgium
H
Hans Weytjens
Research Centre for Information Systems Engineering (LIRIS), KU Leuven, Leuven, Belgium and School of Computation, Information and Technology, Technical University of Munich (TUM), Munich, Germany
Johannes De Smedt
Johannes De Smedt
KU Leuven
Business Process ModelingProcess MiningDeclarative Business Processes