Abstract

AbstractDifferent from most current studies using convolutional neural network (CNN), a deep learning detection method for steel plate surface defects based on Transformer is researched. This paper presents a process, network structure and detection method for steel strip surface defect detection. A Swin Transformer was used to extract hierarchical features in the detection system. Then feature pyramid networks (FPN) was used to fuse the above features to form multi‐scale feature maps, and region proposal network (RPN) was adopted to generate the generate region of interest (ROI) of defects. Finally, the ROI head was used to generate defect category information and its precise location. Here, ablation experiments were conducted to explore the impact of different backbone networks and the number of stages of Swin Transformer and FPN on target detection performance to prove the rationality and efficiency of the network. On the NEU‐DET dataset, the proposed algorithm achieved 81.1% mAP, which is higher than that of classic CNN detection methods such as Faster R‐CNN, SSD, Yolo v3 and RepPoints.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call