Towards Reproducibility in Predictive Process Mining: SPICE - A Deep Learning Library

📅 2025-12-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing predictive process mining (PPM) research suffers from critical challenges: poor model reproducibility, inconsistent evaluation protocols, difficulty integrating new data, and a lack of standardized cross-method comparison. To address these issues, we introduce SPICE—the first open-source PyTorch-based benchmarking framework specifically designed for PPM. SPICE uniformly implements three dominant deep learning architectures—LSTM, GRU, and Transformer—with standardized data preprocessing, configurable training pipelines, and unified evaluation interfaces. Its modular design ensures reproducible, extensible modeling for both key performance indicator (KPI) prediction and behavioral process modeling. We rigorously reproduce experiments across 11 real-world process event logs, confirming consistency with original reported results while enabling fairer, more transparent cross-architecture comparisons. SPICE significantly advances PPM research by enhancing reproducibility, transparency, and methodological comparability.

Technology Category

Application Category

📝 Abstract
In recent years, Predictive Process Mining (PPM) techniques based on artificial neural networks have evolved as a method for monitoring the future behavior of unfolding business processes and predicting Key Performance Indicators (KPIs). However, many PPM approaches often lack reproducibility, transparency in decision making, usability for incorporating novel datasets and benchmarking, making comparisons among different implementations very difficult. In this paper, we propose SPICE, a Python framework that reimplements three popular, existing baseline deep-learning-based methods for PPM in PyTorch, while designing a common base framework with rigorous configurability to enable reproducible and robust comparison of past and future modelling approaches. We compare SPICE to original reported metrics and with fair metrics on 11 datasets.
Problem

Research questions and friction points this paper is trying to address.

Addresses reproducibility issues in predictive process mining
Enhances transparency and usability for novel datasets
Enables robust comparison of deep learning methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep learning library for predictive process mining
PyTorch framework for reproducible baseline methods
Configurable base enabling robust model comparison
🔎 Similar Papers
No similar papers found.
O
Oliver Stritzel
Process Intelligence, Center for Applied Research on Supply Chain Services, Fraunhofer IIS, Nuremberg, Germany
N
Nick Hühnerbein
Process Intelligence, Center for Applied Research on Supply Chain Services, Fraunhofer IIS, Nuremberg, Germany
Simon Rauch
Simon Rauch
Research Associate, LMU Munich, Fraunhofer IIS
Data ScienceMachine LearningPredictive Process Analytics
I
Itzel Zarate
Process Intelligence, Center for Applied Research on Supply Chain Services, Fraunhofer IIS, Nuremberg, Germany
L
Lukas Fleischmann
Process Intelligence, Center for Applied Research on Supply Chain Services, Fraunhofer IIS, Nuremberg, Germany
M
Moike Buck
Process Intelligence, Center for Applied Research on Supply Chain Services, Fraunhofer IIS, Nuremberg, Germany
Attila Lischka
Attila Lischka
Chalmers University of Technology
Graph LearningRouting ProblemsProcess Mining
C
Christian Frey
Process Intelligence, Center for Applied Research on Supply Chain Services, Fraunhofer IIS, Nuremberg, Germany; Machine Learning Lab, University of Technology Nuremberg, Germany