Due to the rising cost of human resources in road maintenance and the pursuit of efficiency, autonomous road inspection vehicles are developed for intelligent detection of road disease to prevent severe traffic disasters in the early stages. Nevertheless, as a prevalent road disease, road cracks are diverse and susceptible to shadows, weather changes, and noise in data acquisition. Moreover, they usually appear with thin shapes that are hard to detect correctly by existing methods. To handle this problem, more details of the road cracks need to be better analyzed. In this article, we propose a refined road crack detection method named locally enhanced cross-shaped windows transformer (LECSFormer), which adopts a delicate design of the encoder-decoder structure. The encoder employs window-based transformer blocks to model long-range dependencies. Each transformer block ensembles the locally enhanced module to enrich the local contextual information, and token shuffle operation is applied to build cross-windows connections. The decoder uses dense connections to fuse multi-scale information, and the feature fusion module fuses hierarchical features and reweights them by the channel attention mechanism. The proposed method outperforms other state-of-the-art methods with ODS of 0.963, 0.917, 0.952, and 0.953 on four challenging datasets, CrackTree260, CrackLS315, Stone331, and CRKWH100. It can accurately detect cracks in road surfaces and support intelligent preventive maintenance of roads.
Read full abstract