Coarse-to-fine crack cue for robust crack detection

📅 2025-07-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Weak cross-domain generalization and neglect of crack elongation characteristics hinder existing crack detection models. To address this, we propose a structure-aware coarse-to-fine crack cue generation method leveraging geometric priors. Specifically, we exploit the thin, elongated nature of cracks for the first time: maximum pooling followed by upsampling constructs a coarse crack-free background; a reconstruction network refines this background; and subsequent differencing yields highly discriminative crack cues. The proposed module is plug-and-play and seamlessly integrates into mainstream detection frameworks. Extensive experiments on multiple benchmark datasets demonstrate consistent performance gains across three state-of-the-art models—particularly notable under cross-domain settings, where robustness and stability improve substantially. Our approach establishes a new paradigm for structure-aware generic crack detection.

Technology Category

Application Category

📝 Abstract
Crack detection is an important task in computer vision. Despite impressive in-dataset performance, deep learning-based methods still struggle in generalizing to unseen domains. The thin structure property of cracks is usually overlooked by previous methods. In this work, we introduce CrackCue, a novel method for robust crack detection based on coarse-to-fine crack cue generation. The core concept lies on leveraging the thin structure property to generate a robust crack cue, guiding the crack detection. Specifically, we first employ a simple max-pooling and upsampling operation on the crack image. This results in a coarse crack-free background, based on which a fine crack-free background can be obtained via a reconstruction network. The difference between the original image and fine crack-free background provides a fine crack cue. This fine cue embeds robust crack prior information which is unaffected by complex backgrounds, shadow, and varied lighting. As a plug-and-play method, we incorporate the proposed CrackCue into three advanced crack detection networks. Extensive experimental results demonstrate that the proposed CrackCue significantly improves the generalization ability and robustness of the baseline methods. The source code will be publicly available.
Problem

Research questions and friction points this paper is trying to address.

Improves crack detection generalization across unseen domains
Leverages thin structure property for robust crack cues
Addresses challenges from complex backgrounds and lighting variations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Coarse-to-fine crack cue generation
Max-pooling and upsampling operations
Plug-and-play reconstruction network integration
🔎 Similar Papers
No similar papers found.
Z
Zelong Liu
National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China; Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China; Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, Wuhan, China
Y
Yuliang Gu
National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China; Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China; Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, Wuhan, China
Z
Zhichao Sun
National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China; Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China; Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, Wuhan, China
H
Huachao Zhu
National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China; Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China; Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, Wuhan, China
Xin Xiao
Xin Xiao
ByteDance Research
VLAVLM
Bo Du
Bo Du
Department of Management, Griffith Business School
Sustainable TransportTravel BehaviourUrban Data AnalyticsLogistics and Supply Chain
Laurent Najman
Laurent Najman
Professor, Laboratoire d'Informatique Gaspard Monge, ESIEE, Université Gustave Eiffel
Computer visionImage processing
Y
Yongchao Xu
National Engineering Research Center for Multimedia Software, Wuhan University, Wuhan, China; Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China; Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University, Wuhan, China