🤖 AI Summary
To address low accuracy in serial crime linkage caused by high-dimensional, sparse, and heterogeneous crime data, this paper proposes a Siamese autoencoder model integrated with geographic-temporal features. Innovatively, geographic-temporal information is incorporated into the decoder stage—rather than the input layer—to prevent dilution of behavioral signals by sparse raw features. A domain-knowledge-guided dimensionality reduction strategy further enhances representation interpretability and practical utility. The model employs contrastive learning to capture behavioral similarity between crime instances and adopts binary encoding to strengthen discriminative capability. Experiments on the real-world forensic dataset ViCLAS demonstrate a 9% improvement in AUC over conventional methods, significantly boosting serial crime association accuracy. This work provides a robust technical foundation for intelligent crime linkage analysis and investigative decision support.
📝 Abstract
Effective crime linkage analysis is crucial for identifying serial offenders and enhancing public safety. To address limitations of traditional crime linkage methods in handling high-dimensional, sparse, and heterogeneous data, we propose a Siamese Autoencoder framework that learns meaningful latent representations and uncovers correlations in complex crime data. Using data from the Violent Crime Linkage Analysis System (ViCLAS), maintained by the Serious Crime Analysis Section of the UK's National Crime Agency, our approach mitigates signal dilution in sparse feature spaces by integrating geographic-temporal features at the decoder stage. This design amplifies behavioral representations rather than allowing them to be overshadowed at the input level, yielding consistent improvements across multiple evaluation metrics. We further analyze how different domain-informed data reduction strategies influence model performance, providing practical guidance for preprocessing in crime linkage contexts. Our results show that advanced machine learning approaches can substantially enhance linkage accuracy, improving AUC by up to 9% over traditional methods while offering interpretable insights to support investigative decision-making.