An Uncertainty-Aware ED-LSTM for Probabilistic Suffix Prediction

📅 2025-05-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limitation of deterministic suffix prediction in business process mining—its inability to capture inherent future uncertainty—this paper proposes the first probabilistic suffix prediction framework, which models the full probability distribution over possible suffix sequences. Methodologically, we introduce the U-ED-LSTM architecture, the first to jointly integrate Monte Carlo Dropout (to quantify epistemic uncertainty) and learnable loss attenuation (to model aleatoric uncertainty), and propose a Monte Carlo suffix sampling algorithm enabling calibrated probabilistic predictions. Experiments on four real-world event logs demonstrate that the framework’s probabilistic prediction mean significantly outperforms conventional most-likely-suffix baselines—especially under sparse prefixes and long suffixes—while its uncertainty estimates exhibit strong calibration, as validated by standard metrics such as expected calibration error.

Technology Category

Application Category

📝 Abstract
Suffix prediction of business processes forecasts the remaining sequence of events until process completion. Current approaches focus on predicting a single, most likely suffix. However, if the future course of a process is exposed to uncertainty or has high variability, the expressiveness of a single suffix prediction can be limited. To address this limitation, we propose probabilistic suffix prediction, a novel approach that approximates a probability distribution of suffixes. The proposed approach is based on an Uncertainty-Aware Encoder-Decoder LSTM (U-ED-LSTM) and a Monte Carlo (MC) suffix sampling algorithm. We capture epistemic uncertainties via MC dropout and aleatoric uncertainties as learned loss attenuation. This technical report provides a detailed evaluation of the U-ED-LSTM's predictive performance and assesses its calibration on four real-life event logs with three different hyperparameter settings. The results show that i) the U-ED-LSTM has reasonable predictive performance across various datasets, ii) aggregating probabilistic suffix predictions into mean values can outperform most likely predictions, particularly for rare prefixes or longer suffixes, and iii) the approach effectively captures uncertainties present in event logs.
Problem

Research questions and friction points this paper is trying to address.

Predicts remaining business process events probabilistically
Addresses uncertainty in process suffix prediction
Combines epistemic and aleatoric uncertainty modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Uncertainty-Aware Encoder-Decoder LSTM
Incorporates Monte Carlo suffix sampling
Learns epistemic and aleatoric uncertainties
🔎 Similar Papers
No similar papers found.
H
Henryk Mustroph
Technical University of Munich, TUM School of Computation, Information and Technology, Garching, Germany
M
Michel Kunkler
Technical University of Munich, TUM School of Computation, Information and Technology, Garching, Germany
Stefanie Rinderle-Ma
Stefanie Rinderle-Ma
Full Professor, Technical University of Munich, Department of Informatics
Information SystemsBusiness Process ManagementProcess-Aware Information SystemsBusiness Intelligence