Performance Analysis: Discovering Semi-Markov Models From Event Logs

📅 2022-06-29
🏛️ IEEE Access
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of efficiently and analytically modeling process execution time statistics from event logs. Methodologically, it introduces the first end-to-end analytical performance analysis framework based on semi-Markov processes: it directly infers execution time means and probability density functions (PDFs) from logs—bypassing simulation entirely. For discrete-time execution times, it employs exact convolution; for continuous-time cases, it approximates PDFs using Gaussian mixture models (GMMs), balancing accuracy, model compactness, and interpretability. Experiments show that the discrete-time approach achieves up to one order of magnitude speedup over simulation under small support sets, while GMM-based representation drastically reduces model size, enabling rapid what-if analysis. The core contribution is the first fully analytical, log-driven inference of semi-Markov performance models—eliminating reliance on traditional simulation-based approaches and establishing a new paradigm for scalable, interpretable process performance analysis.
📝 Abstract
Process mining is a well-established discipline of data analysis focused on the discovery of process models from information systems’ event logs. Recently, an emerging subarea of process mining, known as stochastic process discovery, has started to evolve. Stochastic process discovery considers frequencies of events in the event data and allows for a more comprehensive analysis. In particular, when the durations of activities are presented in the event log, performance characteristics of the discovered stochastic models can be analyzed, e.g., the overall process execution time can be estimated. Existing performance analysis techniques usually discover stochastic process models from event data, and then simulate these models to evaluate their execution times. These methods rely on empirical approaches. This paper proposes analytical techniques for performance analysis that allow for the derivation of statistical characteristics of the overall processes’ execution times in the presence of arbitrary time distributions of events modeled by semi-Markov processes. The proposed methods include express analysis, focused on the mean execution time estimation, and full analysis techniques that build probability density functions (PDFs) of process execution times in both continuous and discrete forms. These methods are implemented and tested on real-world event data, demonstrating their potential for what-if analysis by providing solutions without resorting to simulation. Specifically, we demonstrated that the discrete approach is more time-efficient for small duration support sizes compared to the simulation technique. Furthermore, we showed that the continuous approach, with PDFs represented as Mixtures of Gaussian Models (GMMs), facilitates the discovery of more compact and interpretable models.
Problem

Research questions and friction points this paper is trying to address.

Develops analytical techniques for performance analysis using semi-Markov processes.
Estimates mean execution time and builds probability density functions for process execution.
Provides efficient, simulation-free solutions for what-if analysis in process mining.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Analytical techniques for performance analysis
Semi-Markov processes model arbitrary time distributions
Mixtures of Gaussian Models for compact PDFs
A
A. Kalenkova
Adelaide Data Science Centre (ADSC), School of Computer and Mathematical Sciences, The University of Adelaide, North Terrace Campus, Adelaide, 5000, South Australia, Australia
Lewis Mitchell
Lewis Mitchell
Professor of Data Science, University of Adelaide
online social networkscomputational social sciencedata sciencecomplex systemsdata assimilation
M
M. Roughan
Adelaide Data Science Centre (ADSC), School of Computer and Mathematical Sciences, The University of Adelaide, North Terrace Campus, Adelaide, 5000, South Australia, Australia