Abstract

Statistical texture features are essential for metal surface defect detection. However, low contrast and cluttered backgrounds exacerbate the intrinsic blurriness and variability of defect textures, hampering accurate identification and localization. To tackle these challenges, we propose a Statistical Texture Feature Enhancement Network (STFE-Net) based on YOLO model, a multi-stage approach based on pooling attention mechanism and cross-scale shallow feature reinforcement. Firstly, we enhance the image contrast and propose a Stacked Pooling Attention Residual (SPA-Res) module to capture better texture features; then, we embed the Shallow Feature Weighted (SFW) branches to achieve cross-layer shallow feature reinforcement and add feature layer (AFL) to extract rich semantic features; and finally, we propose an Efficient Complete Intersection over Union (ECIoU) loss function and employ four detection heads to achieve precise localization of defects. On the NEU-DET dataset, our model achieves the state-of-the-art performance. Especially, the mean Average Precision (mAP@IoU = 0.5) of our model is 82.3 %, which is 6.2 % higher than baseline network. With the input resolution of 640 × 640, STFE-Net achieves 54 FPS inference speed with a GeForce GTX 1080. In addition, the experimental results on the DAGM dataset and GC10-DET dataset demonstrate that the proposed model has excellent generalization ability.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call