Cross-modality feature fusion for night pedestrian detection

Yong Feng,Enbo Luo,Suwei Zhai,Hai Lu

doi:10.3389/fphy.2024.1356248

Abstract

Night pedestrian detection with visible image only suffers from the dilemma of high miss rate due to poor illumination conditions. Cross-modality fusion can ameliorate this dilemma by providing complementary information to each other through infrared and visible images. In this paper, we propose a cross-modal fusion framework based on YOLOv5, which is aimed at addressing the challenges of night pedestrian detection under low-light conditions. The framework employs a dual-stream architecture that processes visible images and infrared images separately. Through the Cross-Modal Feature Rectification Module (CMFRM), visible and infrared features are finely tuned on a granular level, leveraging their spatial correlations to focus on complementary information and substantially reduce uncertainty and noise from different modalities. Additionally, we have introduced a two-stage Feature Fusion Module (FFM), with the first stage introducing a cross-attention mechanism for cross-modal global reasoning, and the second stage using a mixed channel embedding to produce enhanced feature outputs. Moreover, our method involves multi-dimensional interaction, not only correcting feature maps in terms of channel and spatial dimensions but also applying cross-attention at the sequence processing level, which is critical for the effective generalization of cross-modal feature combinations. In summary, our research significantly enhances the accuracy and robustness of nighttime pedestrian detection, offering new perspectives and technical pathways for visual information processing in low-light environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-modality feature fusion for night pedestrian detection

Abstract

Talk to us

Similar Papers

More From: Frontiers in Physics

Lead the way for us

Journal: Frontiers in Physics	Publication Date: Mar 26, 2024
License type: CC BY 4.0

Similar Papers

MPDFF: Multi-source Pedestrian detection based on Feature Fusion
Lingxuan Meng ... Jin Ma
-
Lingxuan Meng, et. al.Lingxuan Meng ... Jin Ma
17 Jul 2022
17 Jul 2022

Deep Selective Fusion of Visible and Near-Infrared Images Using Unsupervised U-Net.
Qihui Han ... Cheolkon Jung
IEEE transactions on neural networks and learning systems | VOL. PP
Qihui Han, et. al.Qihui Han ... Cheolkon Jung
01 Jan 2024
IEEE transactions on neural networks and learning systems | VOL. PP

Cross-Modality Fusion and Progressive Integration Network for Saliency Prediction on Stereoscopic 3D Images
Yudong Mao ... Runmin Cong
IEEE Transactions on Multimedia | VOL. 24
Yudong Mao, et. al.Yudong Mao ... Runmin Cong
21 May 2021
IEEE Transactions on Multimedia | VOL. 24

Discriminative unimodal feature selection and fusion for RGB-D salient object detection
Nianchang Huang ... Jungong Han
Pattern Recognition | VOL. 122
Nianchang Huang, et. al.Nianchang Huang ... Jungong Han
01 Oct 2021
Pattern Recognition | VOL. 122

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-modality feature fusion for night pedestrian detection

Abstract

Talk to us

Similar Papers

More From: Frontiers in Physics