Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection

Yongtao Zhang,Song Huang,Linzhen Nie,Zhishuai Yin

doi:10.1109/access.2020.3022623

Yongtao Zhang, Song Huang + Show 2 more

Open Access

https://doi.org/10.1109/access.2020.3022623

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 35	License type: CC BY 4.0

Affiliation: Wuhan University of Technology

Abstract

Multispectral images are increasingly used for pedestrian detection. Preliminary fusion strategies would fail to exploit informative features from cross-spectral images, or worse, may introduce additional interference. In this paper, we propose an attention based multi-layer fusion network in the triple-stream deep convolutional neural network architecture for multispectral pedestrian detection. The effectiveness of multi-layer fusion is examined and verified in this work. Furthermore, a channel-wise attention module (CAM) and a spatial-wise attention module (SAM) are developed and incorporated into the network aiming at more subtle adjustment to weights of multispectral features along both the channel and spatial dimensions respectively. Channel-wise attention is trained with self-supervision while spatial-wise attention is trained with external supervision as we remodel its learning process as saliency detection. Both attention-based weighting mechanisms are evaluated separately and then sequentially. Experimental results on the KAIST dataset show that the proposed multi-layer cross-spectral fusion R-CNN (CS-RCNN), with spatial-wise weighting applied alone, achieves state-of-the-art performance on all-day detection while outperforming compared methods at nighttime.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

An improved deep convolutional neural network architecture for chromosome abnormality detection using hybrid optimization model.
N Nimitha ... P Ezhumalai
Microscopy Research and Technique | VOL. 85
N Nimitha, et. al.N Nimitha ... P Ezhumalai
16 Jun 2022
Microscopy Research and Technique | VOL. 85

DA-Net: Pedestrian Detection Using Dense Connected Block and Attention Modules
Ruihong Yin ... Feng Jiang
IEEE Access | VOL. 8
Ruihong Yin, et. al.Ruihong Yin ... Feng Jiang
01 Jan 2020
IEEE Access | VOL. 8

Face presentation attack identification optimization with adjusting convolution blocks in VGG networks
Sudeep D Thepade ... Shalakha Bang
Intelligent Systems with Applications | VOL. 16
Sudeep D Thepade, et. al.Sudeep D Thepade ... Shalakha Bang
01 Nov 2022
Intelligent Systems with Applications | VOL. 16

A Deep Convolutional Neural Network Architecture for Boosting Image Discrimination Accuracy of Rice Species
P Lin ... Y He
Food and Bioprocess Technology | VOL. 11
P Lin, et. al.P Lin ... Y He
04 Jan 2018
Food and Bioprocess Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention Based Multi-Layer Fusion of Multispectral Images for Pedestrian Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Access