Lining cracking is among the most prevalent forms of tunnel distress, posing significant threats to tunnel operations and vehicular safety. The segmentation of tunnel lining cracks is often hindered by the influence of complex environmental factors, which makes relying solely on local feature extraction insufficient for achieving high segmentation accuracy. To address this issue, this study proposes CGV-Net (CNN, GNN, and ViT networks), a novel tunnel crack segmentation network model that integrates convolutional neural networks (CNNs), graph neural networks (GNNs), and Vision Transformers (ViTs). By fostering information exchange among local features, the model enhances comprehension of the global structural patterns of cracks and improves inference capabilities in recognizing intricate crack configurations. This approach effectively addresses the challenge of modeling contextual information in crack feature extraction. Additionally, the Detailed-Macro Feature Fusion (DMFF) module enables multi-scale feature integration by combining detailed and coarse-grained features, mitigating the significant feature loss encountered during the encoding and decoding stages, and further improving segmentation precision. To overcome the limitations of existing public datasets, which often feature a narrow range of crack types and simplistic backgrounds, this study introduces TunnelCrackDB, a dataset encompassing diverse crack types and complex backgrounds.Experimental evaluations on both the public Crack dataset and the newly developed TunnelCrackDB demonstrate the efficacy of CGV-Net. On the Crack dataset, CGV-Net achieves accuracy, recall, and F1 scores of 73.27% and 57.32%, respectively. On TunnelCrackDB, CGV-Net attains accuracy, recall, and F1 scores of 81.15%, 83.54%, and 82.33%, respectively, showcasing its superior performance in challenging segmentation tasks.
Read full abstract