Abstract

Cracks serve as a significant indicator of aging infrastructure. While the surfaces of traditional infrastructures are visually inspected manually, this approach is labor-intensive, highly subjective, and often surpasses the capabilities of available inspection personnel. Although some researchers have employed traditional image processing and machine learning techniques to address these challenges, the presence of irregular crack shapes, complex lighting conditions, and limitations in detecting only a single type of surface crack complicate automatic crack detection. Recent research indicates that deep learning methods are increasingly leading in image-based feature extraction, object detection, and pixel attribute analysis. This paper introduces a method, Crack_PSTU (Pre-trained Swin Transformer U-Net), which leverages the U-Net framework and Swin Transformer model for infrastructure crack classification and detection tasks. Specifically, we utilize a pre-trained Swin Transformer network as the encoder and implement the decoder using convolution, pooling, and other operations, creating a "U"-shaped model architecture. Our experiments encompassed 11,298 crack images from varied scenes, with 9603 designated for training and 1695 for validation. The findings reveal that our approach surpasses other algorithms for this dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.