Accurate and Noise-Tolerant Extraction of Routine Logs in Robotic Process Automation (Extended Version)

📅 2025-10-09

📈 Citations: 0

✨ Influential: 0

career value

217K/year

🤖 AI Summary

Existing work primarily focuses on action-set extraction, neglecting end-to-end routine model discovery and lacking validation on real-world UI logs corrupted by execution variability and human errors. This paper proposes a noise-tolerant clustering method that, for the first time, directly enables complete and high-precision extraction of routine logs from raw UI interaction traces—thereby facilitating routine pattern discovery in robotic process automation (RPA). Our approach integrates behavioral similarity measurement with an adaptive noise-filtering mechanism to robustly identify and reconstruct routine execution paths. Extensive experiments across nine publicly available UI log datasets demonstrate that our method achieves an average F1-score improvement of over 15% under high-noise conditions, significantly outperforming state-of-the-art techniques. The key contributions include: (i) the first end-to-end routine discovery framework tailored for noisy UI logs; (ii) a principled noise-resilient clustering strategy grounded in behavioral semantics; and (iii) empirically validated superiority in both accuracy and robustness.

Technology Category

Application Category

📝 Abstract

Robotic Process Mining focuses on the identification of the routine types performed by human resources through a User Interface. The ultimate goal is to discover routine-type models to enable robotic process automation. The discovery of routine-type models requires the provision of a routine log. Unfortunately, the vast majority of existing works do not directly focus on enabling the model discovery, limiting themselves to extracting the set of actions that are part of the routines. They were also not evaluated in scenarios characterized by inconsistent routine execution, hereafter referred to as noise, which reflects natural variability and occasional errors in human performance. This paper presents a clustering-based technique that aims to extract routine logs. Experiments were conducted on nine UI logs from the literature with different levels of injected noise. Our technique was compared with existing techniques, most of which are not meant to discover routine logs but were adapted for the purpose. The results were evaluated through standard state-of-the-art metrics, showing that we can extract more accurate routine logs than what the state of the art could, especially in the presence of noise.

Problem

Research questions and friction points this paper is trying to address.

Extracting accurate routine logs from user interface interactions

Handling inconsistent routine execution and noise in process data

Enabling robotic process automation through routine-type model discovery

Innovation

Methods, ideas, or system contributions that make the work stand out.

Clustering-based technique extracts routine logs

Method handles inconsistent execution with noise tolerance

Outperforms existing techniques in noisy UI logs

🔎 Similar Papers

What Information Contributes to Log-based Anomaly Detection? Insights from a Configurable Transformer-Based Approach