Abstract Concrete surface crack detection and maintenance are crucial for ensuring structural safety. Deep learning-based techniques for detecting concrete cracks have become popular due to the quick advancement of artificial intelligence. However, the actual uses of these methods are limited due to issues like large model sizes and significant dependence on powerful computing hardware. To address these issues, this paper presents a lightweight multi-scale encoder-decoder network (LMED-Net) for crack detection of concrete structures. LMED-Net employs MobileNetV2 as the encoder for the initial feature extraction. A multi-scale feature extraction (MFE) module is developed and serially attached after the encoder for refining feature extraction. Finally, to strengthen the network's perception of pixels surrounding the cracks, a novel enhanced attention mechanism (EAM) is deployed in the decoder. By improving the network's attention to information within the crack regions, this mechanism keeps contextual information from being lost. Comparative experimental results show that the proposed network achieves an F1 Score(F1) of 60.32% and a mean intersection over union (mIoU) of 71.04% on the CFD dataset. On the DeepCrack dataset, the F1 and mIoU increase to 79.09% and 81.85% respectively. Notably, LMED-Net performs exceptionally well in crack segmentation since its model size and parameters count are much smaller than those of other image segmentation methods. Furthermore, ablation studies further validate the effectiveness of the proposed MFE module and EAM.
Read full abstract