Vehicle detection and tracking from a UAV perspective often encounters omission and misdetection due to the small targets, complex scenes and target occlusion, which finally influences hugely on detection accuracy and target tracking stability. Additionally, the number of parameters of current model is large that makes it is hard to be deployed on mobile devices. Therefore, this paper proposes a YOLO-LMP and NGCTrack-based target detection and tracking algorithm to address these issues. Firstly, the performance of detecting small targets in occluded scenes is enhanced by adding a MODConv to the small-target detection head and increasing its size; In addition, excessive deletion of prediction boxes is prevented by utilizing LSKAttention mechanism to adaptively adjust the target sensing field at the downsampling stage and combining it with the Soft-NMS strategy. Furthermore, the C2f module is replaced by the FPW to reduce the pointless computation and memory utilization of the model. At the target tracking stage, the so-called NGCTrack in our algorithm replaces IOU with GIOU and employs a modified NSA Kalman filter to adjust the state-space aspect ratio for width prediction. Finally, the camera adjustment mechanism was introduced to improve the precision and consistency of tracking. The experimental results show that, compared to YOLOv8, the YOLO-LMP model improves map50 and map50:95 metrics by 10.3 and 12.2%, respectively and the number of parameters is decreased by 47.7%. After combined it with the improved NGCTrack, the number of IDSW reduced by 73.6% compared to the ByteTrack method, while the MOTA and IDF1 increase by 5.2 and 9.8%, respectively.
Read full abstract