Deep learning-based defect detection methods have gained widespread application in industrial quality inspection. However, limitations such as insufficient sample sizes, low data utilization, and issues with accuracy and speed persist. This paper proposes a semi-supervised semantic segmentation framework that addresses these challenges through perturbation invariance at both the image and feature space. The framework employs diverse perturbation cross-pseudo-supervision to reduce dependency on extensive labeled datasets. Our lightweight method incorporates edge pixel-level semantic information and shallow feature fusion to enhance real-time performance and improve the accuracy of defect edge detection and small target segmentation in industrial inspection. Experimental results demonstrate that the proposed method outperforms the current state-of-the-art (SOTA) semi-supervised semantic segmentation methods across various industrial scenarios. Specifically, our method achieves a mean Intersection over Union (mIoU) 3.11% higher than the SOTA method on our dataset and 4.39% higher on the public KolektorSDD dataset. Additionally, our semantic segmentation network matches the speed of the fastest network, U-net, while achieving a mIoU 2.99% higher than DeepLabv3Plus.
Read full abstract