Many tomb murals have punctate losses, cracks, and craquelure due to underground subsidence and changes in their physical support. Visual non-destructive detection techniques enable rapid assessment of how much tomb murals are affected by cracking, providing suggestions for their conservation. However, tomb murals are typically created by sketching outlines and then colored. Detailed sketches can easily interfere with crack detection work, requiring the use of deep learning network to better learn crack features and improve detection accuracy. At the same time the limited data of tomb mural presents a challenge to build a deep learning network. To address these issues, this paper introduces a novel dual-attention detection network (DADNet) for crack segmentation of tomb murals. In this work, a customized dataset is first constructed by collecting mural images from the Tang Dynasty tombs. Then the ConvNeXt framework serves as the basis for feature extraction, enhancing the process. Lastly, a dual-attention module utilizing neighborhood attention and biaxial attention is employed to accurately identify the crack regions. Neighborhood attention performs a local self-attention operation around the pixel point, addressing the limitations of self-attention. This approach significantly reduces computational demands as the image size increases. Biaxial attention performs attention calculations in the horizontal and vertical directions. This compensates for the limitation of neighborhood attention in capturing global dependencies. Our DADNet outperformed the competing methods, achieving the highest recorded scores of 78.95% for MIoU and 61.05% for the Jaccard index.
Read full abstract