Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography

📅 2025-01-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In pulsed thermography (PTI) non-destructive testing, conventional principal component analysis (PCA) and thermal signal reconstruction (TSR) feature representations suffer from inadequate subsurface defect segmentation and depth estimation due to their decoupled, unimodal nature. To address this, we propose PT-Fusion, a multimodal attention-based fusion network. PT-Fusion introduces two novel components: the Encoder Attention Fusion Gate (EAFG), enabling adaptive weighted fusion of PCA and TSR features within the encoder, and the Attention-Enhanced Decoder Block (AEDB), facilitating cross-modal interaction and refinement in the decoder. Additionally, a thermal sequence random sampling data augmentation strategy is incorporated to mitigate small-sample limitations. Built upon a U-Net variant architecture, PT-Fusion achieves ~10% improvement in both defect segmentation and depth estimation accuracy over U-Net, Attention U-Net, and 3D-CNN on the Laval IRT-PVC dataset, significantly enhancing localization precision and quantitative assessment capability for internal defects in industrial components.

Technology Category

Application Category

📝 Abstract
AI-driven pulse thermography (PT) has become a crucial tool in non-destructive testing (NDT), enabling automatic detection of hidden anomalies in various industrial components. Current state-of-the-art techniques feed segmentation and depth estimation networks compressed PT sequences using either Principal Component Analysis (PCA) or Thermographic Signal Reconstruction (TSR). However, treating these two modalities independently constrains the performance of PT inspection models as these representations possess complementary semantic features. To address this limitation, this work proposes PT-Fusion, a multi-modal attention-based fusion network that fuses both PCA and TSR modalities for defect segmentation and depth estimation of subsurface defects in PT setups. PT-Fusion introduces novel feature fusion modules, Encoder Attention Fusion Gate (EAFG) and Attention Enhanced Decoding Block (AEDB), to fuse PCA and TSR features for enhanced segmentation and depth estimation of subsurface defects. In addition, a novel data augmentation technique is proposed based on random data sampling from thermographic sequences to alleviate the scarcity of PT datasets. The proposed method is benchmarked against state-of-the-art PT inspection models, including U-Net, attention U-Net, and 3D-CNN on the Universit'e Laval IRT-PVC dataset. The results demonstrate that PT-Fusion outperforms the aforementioned models in defect segmentation and depth estimation accuracies with a margin of 10%.
Problem

Research questions and friction points this paper is trying to address.

Pulse Thermography
Image Processing
Non-Destructive Testing
Innovation

Methods, ideas, or system contributions that make the work stand out.

PT-Fusion
Attention-based Feature Fusion
Random Heat Image Selection Strategy
🔎 Similar Papers
No similar papers found.
Mohammed Salah
Mohammed Salah
PhD Candidate, Khalifa University
Neuromorphic visionRobotic perceptionNon-Destructive TestingInfrared Thermography
N
N. Werghi
Department of Computer Science, Khalifa University, Abu Dhabi, UAE
D
D. Svetinovic
Department of Computer Science, Khalifa University, Abu Dhabi, UAE
Y
Y. Abdulrahman
Department of Aerospace Engineering, Khalifa University, Abu Dhabi, UAE