Abstract
Although compressive measurements save data storage and bandwidth usage, they are difficult to be used directly for target tracking and classification without pixel reconstruction. This is because the Gaussian random matrix destroys the target location information in the original video frames. This paper summarizes our research effort on target tracking and classification directly in the compressive measurement domain. We focus on one particular type of compressive measurement using pixel subsampling. That is, original pixels in video frames are randomly subsampled. Even in such a special compressive sensing setting, conventional trackers do not work in a satisfactory manner. We propose a deep learning approach that integrates YOLO (You Only Look Once) and ResNet (residual network) for multiple target tracking and classification. YOLO is used for multiple target tracking and ResNet is for target classification. Extensive experiments using short wave infrared (SWIR), mid-wave infrared (MWIR), and long-wave infrared (LWIR) videos demonstrated the efficacy of the proposed approach even though the training data are very scarce.
Highlights
There are many applications such as traffic monitoring, surveillance, and security monitoring that use optical and infrared videos [1]-[6]
YOLO is used for multiple target tracking and residual network (ResNet) is for target classification
The Center Location Error (CLE) values generally increase whereas the Distance Precision (DP) and EinGT values are relatively stable
Summary
There are many applications such as traffic monitoring, surveillance, and security monitoring that use optical and infrared videos [1]-[6].
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have