This study proposed an interpretable multi-scale infrared small object detection network (IMD-Net) design method to improve the precision of infrared small object detection and contour segmentation in complex backgrounds. To this end, a multi-scale object enhancement module was constructed, which converted artificially designed features into network structures. The network structure was used to enhance actual objects and extract shallow detail and deep semantic features of images. Next, a global object response, channel attention, and multilayer feature fusion modules were introduced, combining context and channel information and aggregated information, selected data, and decoded objects. Finally, the multiple loss constraint module was constructed, which effectively constrained the network output using multiple losses and solved the problems of high false alarms and high missed detections. Experimental results showed that the proposed network model outperformed local energy factor (LEF), self-regularized weighted sparse model (SRWS), asymmetric contextual modulation (ACM), and other state of the art methods in the intersection-over-union (IoU) and Fmeasure values by 10.8% and 11.3%, respectively. The proposed method performed best on the currently available datasets, achieving accurate detection and effective segmentation of dim and small objects in various infrared complex background images.
Read full abstract