🤖 AI Summary
This study addresses the problem of enhancing interpretability in neural networks for writer identification (WI) and writer verification (WV), supporting forensic experts’ understanding of model decisions and improving system reliability. Methodologically, it introduces— for the first time in this domain—two complementary interpretability techniques: pixel-level saliency maps (e.g., Grad-CAM) to localize discriminative regions in handwriting samples, and pointwise saliency maps to quantify similarity between textual and graphical features. A dual-path evaluation framework is proposed, integrating qualitative assessment (comparison against expert-annotated ground truth) with quantitative metrics (deletion and insertion scores). Experimental results demonstrate that pixel-level methods significantly outperform pointwise approaches in discriminative region localization accuracy, deletion/insertion scores, and visual consistency with forensic expert judgments. These findings establish a trustworthy, forensically grounded foundation for explainable AI in handwriting analysis.
📝 Abstract
Neural Networks are the state of the art for many tasks in the computer vision domain, including Writer Identification (WI) and Writer Verification (WV). The transparency of these"black box"systems is important for improvements of performance and reliability. For this work, two transparency techniques are applied to neural networks trained on WI and WV for the first time in this domain. The first technique provides pixel-level saliency maps, while the point-specific saliency maps of the second technique provide information on similarities between two images. The transparency techniques are evaluated using deletion and insertion score metrics. The goal is to support forensic experts with information on similarities in handwritten text and to explore the characteristics selected by a neural network for the identification process. For the qualitative evaluation, the highlights of the maps are compared to the areas forensic experts consider during the identification process. The evaluation results show that the pixel-wise saliency maps outperform the point-specific saliency maps and are suitable for the support of forensic experts.