Multimodal Feature Fusion YOLOv5 for RGB-T Object Detection

Shiyang Meng,Ye Liu

doi:10.1109/cac57257.2022.10055263

Multimodal Feature Fusion YOLOv5 for RGB-T Object Detection

Shiyang Meng, Ye Liu

https://doi.org/10.1109/cac57257.2022.10055263

Copy DOI

Export

Save

Cite

Publication Date: Nov 25, 2022

Citations: 3

Affiliation: Nanjing University of Posts and Telecommunications

#Multimodal Image Pairs #Multimodal Feature Fusion #Multimodal Fusion #Multimodal Network #Multimodal Feature #State-of-the-art #Multimodal Fusion Network #Feature Fusion #Video Surveillance #Thermal Images

Abstract
Full-Text
Similar Papers

Abstract

Listen

Multimodal image pairs (e.g. visible and thermal images) can provide mutually beneficial pixel information and enhance the robustness and reliability of object detection in applications such as autonomous driving and video surveillance. To benefit from the effective information of both modalities, a multimodal feature fusion network based on YOLOv5 is proposed in this paper. Multimodal feature fusion adaptive weighting module is designed to perform feature extraction and fusion at three scales in the network to achieve the best utilization of multimodal features. Experiments show that our multimodal object detection network (MFF-YOLOv5) achieves better performance on two public datasets compared with the current state-of-the-art (SOTA) methods.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Multimodal Feature Fusion YOLOv5 for RGB-T Object Detection

Abstract

Published Version

Talk to us

Similar Papers

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Multimodal Feature Fusion YOLOv5 for RGB-T Object Detection

Abstract

Published Version

Talk to us

Similar Papers