FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM

📅 2025-01-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the low segmentation accuracy and poor generalization of existing methods on low-performance devices—particularly under challenging conditions such as blurry images, complex backgrounds, and faint cracks—this paper proposes a lightweight and robust end-to-end crack segmentation framework. Methodologically, it decouples universal feature extraction (via an enhanced EdgeSAM encoder that removes fixed-resolution constraints) from task-specific decoding, and introduces the Information-Gated Attention Mechanism (IGAM), a novel module enabling adaptive multi-scale feature fusion. Contributions include: (1) the first crack segmentation framework supporting zero-shot cross-domain generalization at arbitrary input resolutions; (2) state-of-the-art performance across multiple benchmarks; (3) efficient inference with low GPU memory footprint; and (4) superior robustness against image blur, noise, and visual ambiguity.

Technology Category

Application Category

📝 Abstract
Automatic crack segmentation is a cornerstone technology for intelligent visual perception modules in road safety maintenance and structural integrity systems. Existing deep learning models and ``pre-training + fine-tuning'' paradigms often face challenges of limited adaptability in resource-constrained environments and inadequate scalability across diverse data domains. To overcome these limitations, we propose FlexiCrackNet, a novel pipeline that seamlessly integrates traditional deep learning paradigms with the strengths of large-scale pre-trained models. At its core, FlexiCrackNet employs an encoder-decoder architecture to extract task-specific features. The lightweight EdgeSAM's CNN-based encoder is exclusively used as a generic feature extractor, decoupled from the fixed input size requirements of EdgeSAM. To harmonize general and domain-specific features, we introduce the information-Interaction gated attention mechanism (IGAM), which adaptively fuses multi-level features to enhance segmentation performance while mitigating irrelevant noise. This design enables the efficient transfer of general knowledge to crack segmentation tasks while ensuring adaptability to diverse input resolutions and resource-constrained environments. Experiments show that FlexiCrackNet outperforms state-of-the-art methods, excels in zero-shot generalization, computational efficiency, and segmentation robustness under challenging scenarios such as blurry inputs, complex backgrounds, and visually ambiguous artifacts. These advancements underscore the potential of FlexiCrackNet for real-world applications in automated crack detection and comprehensive structural health monitoring systems.
Problem

Research questions and friction points this paper is trying to address.

Crack Detection
Low Performance Devices
Complex Conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

FlexiCrackNet
Knowledge Integration
Smart Feature Selection
🔎 Similar Papers
No similar papers found.
X
Xinlong Wan
School of Electronic and Electronics Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
X
Xiaoyan Jiang
School of Electronic and Electronics Engineering, Shanghai University of Engineering Science, Shanghai 201620, China, and also with Leiden University, Leiden, EZ 2311, The Netherlands
G
Guangsheng Luo
School of Electronic and Electronics Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
Ferdous Sohel
Ferdous Sohel
Murdoch University
Computer VisionMachine LearningDigital AgricultureMedical Imaging and Health Analytics
J
Jenq-Neng Hwang
Department of Electrical Engineering, University of Washington, Seattle, WA 98105 USA