AVOCADO: The Streaming Process Mining Challenge

📅 2025-10-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Evaluating algorithm performance in streaming process mining lacks standardization and struggles to jointly optimize accuracy, latency, and robustness across diverse real-world challenges (e.g., out-of-order events, memory constraints, throughput variability). Method: We propose AVOCADO—the first open-source evaluation framework specifically designed for streaming process mining. It decouples the *conceptual layer* (defining evaluation dimensions and semantics) from the *instantiation layer* (implementing concrete datasets and algorithm interfaces), enabling scalable, incremental, and realistic assessment—including handling event reordering, throughput fluctuations, and memory overhead. Contribution/Results: AVOCADO integrates streaming-specific metrics (e.g., MAE, RMSE, end-to-end latency) and provides a unified benchmark with automated evaluation pipelines. Experimental validation across multiple algorithms and scenarios demonstrates its effectiveness, significantly improving reproducibility and community collaboration, and establishing a methodological foundation for advancing streaming process mining.

Technology Category

Application Category

📝 Abstract
Streaming process mining deals with the real-time analysis of streaming data. Event streams require algorithms capable of processing data incrementally. To systematically address the complexities of this domain, we propose AVOCADO, a standardized challenge framework that provides clear structural divisions: separating the concept and instantiation layers of challenges in streaming process mining for algorithm evaluation. The AVOCADO evaluates algorithms on streaming-specific metrics like accuracy, Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Processing Latency, and robustness. This initiative seeks to foster innovation and community-driven discussions to advance the field of streaming process mining. We present this framework as a foundation and invite the community to contribute to its evolution by suggesting new challenges, such as integrating metrics for system throughput and memory consumption, and expanding the scope to address real-world stream complexities like out-of-order event arrival.
Problem

Research questions and friction points this paper is trying to address.

Standardizing algorithm evaluation for streaming process mining challenges
Assessing streaming algorithms using accuracy and latency metrics
Addressing real-world stream complexities like out-of-order events
Innovation

Methods, ideas, or system contributions that make the work stand out.

Standardized challenge framework for streaming process mining
Separates concept and instantiation layers for evaluation
Evaluates algorithms using streaming-specific performance metrics
🔎 Similar Papers
No similar papers found.