๐ค AI Summary
Existing work primarily focuses on action-set extraction, neglecting end-to-end routine model discovery and lacking validation on real-world UI logs corrupted by execution variability and human errors. This paper proposes a noise-tolerant clustering method that, for the first time, directly enables complete and high-precision extraction of routine logs from raw UI interaction tracesโthereby facilitating routine pattern discovery in robotic process automation (RPA). Our approach integrates behavioral similarity measurement with an adaptive noise-filtering mechanism to robustly identify and reconstruct routine execution paths. Extensive experiments across nine publicly available UI log datasets demonstrate that our method achieves an average F1-score improvement of over 15% under high-noise conditions, significantly outperforming state-of-the-art techniques. The key contributions include: (i) the first end-to-end routine discovery framework tailored for noisy UI logs; (ii) a principled noise-resilient clustering strategy grounded in behavioral semantics; and (iii) empirically validated superiority in both accuracy and robustness.
๐ Abstract
Robotic Process Mining focuses on the identification of the routine types performed by human resources through a User Interface. The ultimate goal is to discover routine-type models to enable robotic process automation. The discovery of routine-type models requires the provision of a routine log. Unfortunately, the vast majority of existing works do not directly focus on enabling the model discovery, limiting themselves to extracting the set of actions that are part of the routines. They were also not evaluated in scenarios characterized by inconsistent routine execution, hereafter referred to as noise, which reflects natural variability and occasional errors in human performance. This paper presents a clustering-based technique that aims to extract routine logs. Experiments were conducted on nine UI logs from the literature with different levels of injected noise. Our technique was compared with existing techniques, most of which are not meant to discover routine logs but were adapted for the purpose. The results were evaluated through standard state-of-the-art metrics, showing that we can extract more accurate routine logs than what the state of the art could, especially in the presence of noise.