Abstract Steel is one of the most common and widely used materials in modern industrial production, and has been widely and deeply used in the field of engineering construction. However, the manufacturing process and external factors can lead to defect problems that seriously affect the quality and appearance of the steel. Defects have the characteristics of multi-scale, weak texture, many dense and small defects, and interference in the processing environment, which are challenging for the actual location and classification of defects. The detection of these surface defects is challenging due to their multi-scale nature, weak textures, numerous small defects, and complex background interference. To address these challenges, an EMC-YOLO algorithm is proposed to detect defects accurately. First, an Enhanced Fast Feature Extraction (EFFE) module is constructed. It integrates local saliency information with global saliency information and achieves multi-depth feature fusion. The EFFE replaces the bottleneck structure of C2F, enhancing feature extraction capabilities. Secondly, to optimize multi-scale defect feature detection for small and elongated defects, a Multi-Scale Receptive Field Spatial. Pooling Fast Pyramid (MRF-SPPF) module is proposed. Finally, Cross-Reinforced Connections Across Spatial Features (CCASF) is designed to give full play to the feature representation extracted after backbone network enhancement and deeply integrate it with neck features. This closely links the enhanced backbone features with the enriched neck features. On the NEU-DET dataset, the mAP value of the proposed model is improved by 3.5%, and the accuracy of the model finally reaches 80.7%. At the same time, in another GC10-DET dataset, our model also achieved 73.5% mAP value. It achieves good accuracy while satisfying real-time performance. It has a good application prospect in the actual strip processing.
Read full abstract