DCUFormer: Enhancing pavement crack segmentation in complex environments with dual-cross/upsampling attention

Jinhuan Shan,Yue Huang,Wei Jiang

doi:10.1016/j.eswa.2024.125891

Abstract

Efficient road inspection and maintenance are essential to extend pavement lifespan and enhance safety. However, automated crack detection remains challenging due to varied environmental conditions and differences in image collection equipment, making robust algorithm development a critical need. Vision Transformers, with their capacity to capture long-range dependencies, offer significant advantages for crack detection in complex scenarios by effectively extracting global features. Nevertheless, existing Transformer-based methods encounter difficulties in boundary delineation due to decoder design limitations, which lead to suboptimal fusion of low-level and high-level features. To address this issue, we propose a comprehensive approach that integrates semantic preservation, detail refinement, and detail delineation. These concepts are realized through our novel Dual-Cross Attention Module (DCA) and Upsampling Attention Module (UA). The DCA module progressively filters redundant details from low-level feature layers using high-level semantic information, while preserving boundary details to refine high-level feature boundaries. In addition, the UA module employs progressive local cross-attention in upsampling, facilitating more precise boundary definitions and surpassing conventional dynamic upsampling methods. Our approach, utilizing both lightweight (MiT-B0, LVT) and middle-weight (Swin-T) backbones, demonstrates state-of-the-art performance on three diverse datasets—Crack500, CrackSC, and UAV-Crack500—highlighting its robustness across varied conditions. This work contributes to advancing Transformer-based architectures for defect segmentation in complex engineering contexts, underscoring the critical role of improved feature fusion in crack detection. The code is available at: https://github.com/SHAN-JH/DCUFormer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DCUFormer: Enhancing pavement crack segmentation in complex environments with dual-cross/upsampling attention

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

An image retrieval technique based on texture features using semantic properties
K.P Jisha ... Bella Mary I Thusnavis
-
K.P Jisha, et. al.K.P Jisha ... Bella Mary I Thusnavis
01 Feb 2013
01 Feb 2013

Enhancing feature fusion for human pose estimation
Rui Wang ... Jiangwei Tong
Machine Vision and Applications | VOL. 31
Rui Wang, et. al.Rui Wang ... Jiangwei Tong
24 Sep 2020
Machine Vision and Applications | VOL. 31

A Deeply Supervised Convolutional Neural Network for Pavement Crack Detection With Multiscale Feature Fusion.
Zhong Qu ... Chong Cao
IEEE transactions on neural networks and learning systems | VOL. 33
Zhong Qu, et. al.Zhong Qu ... Chong Cao
15 Mar 2021
IEEE transactions on neural networks and learning systems | VOL. 33

LLAM-MDCNet for Detecting Remote Sensing Images of Dead Tree Clusters
Zongchen Li ... Ruoli Yang
Remote Sensing | VOL. 14
Zongchen Li, et. al.Zongchen Li ... Ruoli Yang
01 Aug 2022
Remote Sensing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DCUFormer: Enhancing pavement crack segmentation in complex environments with dual-cross/upsampling attention

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications