🤖 AI Summary
Reliable detection of minute transient rail breaks (2–10 cm) in onboard intelligent Structural Health Monitoring (iSHM) remains challenging due to strong vehicle dynamics, high-frequency local noise, and severe scarcity of labeled data.
Method: This paper proposes an unsupervised deep learning framework featuring a self-attention–based Transformer architecture, which innovatively leverages attention weight deviation to generate interpretable anomaly scores; an incremental synthetic dataset benchmark is constructed to systematically evaluate robustness under varying speeds, multi-channel inputs, and noise perturbations; and reconstruction loss drives training for precise anomaly localization.
Results: Experiments demonstrate state-of-the-art detection accuracy, significantly improved inference efficiency, and reveal high-frequency local noise as a critical bottleneck for practical iSHM deployment.
📝 Abstract
Indirect structural health monitoring (iSHM) for broken rail detection using onboard sensors presents a cost-effective paradigm for railway track assessment, yet reliably detecting small, transient anomalies (2-10 cm) remains a significant challenge due to complex vehicle dynamics, signal noise, and the scarcity of labeled data limiting supervised approaches. This study addresses these issues through unsupervised deep learning. We introduce an incremental synthetic data benchmark designed to systematically evaluate model robustness against progressively complex challenges like speed variations, multi-channel inputs, and realistic noise patterns encountered in iSHM. Using this benchmark, we evaluate several established unsupervised models alongside our proposed Attention-Focused Transformer. Our model employs a self-attention mechanism, trained via reconstruction but innovatively deriving anomaly scores primarily from deviations in learned attention weights, aiming for both effectiveness and computational efficiency. Benchmarking results reveal that while transformer-based models generally outperform others, all tested models exhibit significant vulnerability to high-frequency localized noise, identifying this as a critical bottleneck for practical deployment. Notably, our proposed model achieves accuracy comparable to the state-of-the-art solution while demonstrating better inference speed. This highlights the crucial need for enhanced noise robustness in future iSHM models and positions our more efficient attention-based approach as a promising foundation for developing practical onboard anomaly detection systems.