MOD-YOLO: Multispectral object detection based on transformer dual-stream YOLO

Yanhua Shao,Qimeng Huang,Yanying Mei,Hongyu Chu

doi:10.1016/j.patrec.2024.05.001

Abstract

Multispectral object detection can effectively improve the precision of object detection in low-visibility scenes, which increases the reliability and stability of the object detection application in the open environment. Cross-Modality Fusion Transformer (CFT) can effectively fuse different spectral information, but this method relies on large models and expensive computing resources. In this paper, we propose multispectral object detection dual-stream YOLO (MOD-YOLO), based on Cross Stage Partial CFT (CSP-CFT), to address the issue that prior studies need heavy inference calculations from the recurrent fusing of multispectral features. This network can divide the fused feature map into two parts, respectively for cross stage output and combined with the next stage feature, to achieve the correct speed/memory/precision balance. To further improve the accuracy, SIoU was selected as the loss function. Ultimately, extensive experiments on multiple publicly available datasets demonstrate that our model, which achieves the smallest model size and excellent performance, produces better tradeoffs between accuracy and model size than other popular models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MOD-YOLO: Multispectral object detection based on transformer dual-stream YOLO

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: May 3, 2024
Citations: 2

Similar Papers

Multispectral Object Detection for Autonomous Vehicles
Karasawa Takumi ... Yoshitaka Ushiku
-
Karasawa Takumi, et. al.Karasawa Takumi ... Yoshitaka Ushiku
23 Oct 2017
23 Oct 2017

Multi-Spatial Pyramid Feature and Optimizing Focal Loss Function for Object Detection
Sheng-Ye Wang ... Le-Yuan Gao
IEEE Transactions on Intelligent Vehicles | VOL. 9
Sheng-Ye Wang, et. al.Sheng-Ye Wang ... Le-Yuan Gao
01 Jan 2024
IEEE Transactions on Intelligent Vehicles | VOL. 9

Comparative study on object detection in visual scenes using deep learning
Kapil Kumar ... Kamal Kant Verma
World Journal of Advanced Engineering Technology and Sciences | VOL. 10
Kapil Kumar, et. al. Kapil Kumar ... Kamal Kant Verma
30 Nov 2023
World Journal of Advanced Engineering Technology and Sciences | VOL. 10

Assessing thermal imagery integration into object detection methods on air-based collection platforms
James E Gallagher ... Edward J Oughton
Scientific Reports | VOL. 13
James E Gallagher, et. al.James E Gallagher ... Edward J Oughton
25 May 2023
Scientific Reports | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MOD-YOLO: Multispectral object detection based on transformer dual-stream YOLO

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters