Abstract

Infrared dim and small target detection is widely used in military and civil fields. Traditional methods in that application rely on the local contrast between the target and background for single-frame detection. On the other hand, those algorithms depend on the motion model with fixed parameters for multi-frame association. For the great similarity of gray value and the dynamic changes of motion model parameters in the condition of low SNR and strong clutter, those methods possess weak robustness, low detection probability, and high false alarm rate. In this paper, an infrared video sequences encoding and decoding model based on Bidirectional Convolutional Long Short-Term Memory structure (Bi-Conv-LSTM) and 3D Convolutional structure (3D-Conv) is proposed, addressing the problem of high similarity and dynamic changes of parameters. For solving the problem of dynamic change in parameters, Bi-Conv-LSTM structure is used to learn the motion model of targets. And for the problem of low local contrast, 3D-Conv structure is adopted to extend receptive field in the time dimension. In order to improve the precision of detection, the Decoding part is divided into two different full connections with distinctive active function. Simulation results show that the trajectory detection accuracy of the proposed model is more than 90% under the condition of low SNR and maneuvering motion, which is better than traditional method of 80% in DB-TBD 20% in others. Real data experiment to illustrate that that our proposed method can detect small infrared targets of a low false alarm rate and high detection probability.

Highlights

  • With the advantages of strong concealment and wide field of view, space-based infrared detection system plays a great role in the military and civil fields, and has been widely concerned by academia and industry

  • We propose a Encoding-Decoding model of infrared video sequences is proposed: Bi-Conv-LSTM structure is used for addressing the problem of dynamic change of parameters and 3D Convolutional structure (3D-Conv) structure is used for encoding video sequences into highdimensional feature vector, and two-way full connection (FC) structure is used for calculating the target positions and confidence simultaneously

  • In order to reflect the effectiveness of the proposed model, the model proposed in this paper has good we compare dynamic programming algorithm (DP-Tracking Before Detection algorithm (TBD)) [11], generalization performance, and its prediction results has high time variance filter algorithm (TVF-TBD) [9], Kalman credibility

Read more

Summary

INTRODUCTION

With the advantages of strong concealment and wide field of view, space-based infrared detection system plays a great role in the military and civil fields, and has been widely concerned by academia and industry. To introduce the time dimension information, a time variance filter (TVF) algorithm [9] for multi-frame sequences is proposed by Lvping yue et al, which focuses on the intensity distribution of grayscale values of each pixel in time and analyze the characteristics respectively It would cause missed detection and a large number of false alarms when the targets have not strong gray value. In order to extract spatial-temporal information at the same time, Sinn Y U et al [16] uses 3DConv to conduct time-space joint detection of multi-frame images This method introduces 3D-Conv structure to process multi-frame infrared images for the first time.

Method
LOSS FUNCTION
EXPERIMENTS UNDER DIFFERENT MOTION
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.