Three‐stage RGBD architecture for vehicle and pedestrian detection using convolutional neural networks and stereo vision

Pedro Augusto Pinho Ferraz,Carlos Augusto Paiva Da Silva Martins,Flávia Magalhães Freitas Ferreira,Bernardo Augusto Godinho Oliveira

doi:10.1049/iet-its.2019.0367

Abstract

With the growth of autonomous vehicles and collision-avoidance systems, several approaches using deep learning and convolutional neural networks (CNNs) continually address accuracy improvement in obstacle detection. The authors introduce a three-stage architecture that adds side channels as low-level features to serve as input to existing CNNs. In a case study, the architecture is used to extract depth from stereo cameras, and then compose RGBD inputs to state-of-the-art CNNs to improve their vehicle and pedestrian detection accuracy. This can be achieved by simple modifications on the first layers of any existing CNN with RGB inputs. To validate the architecture, the state-of-the-art matching cost-CNN, and cascade residual learning, both specialist algorithms to extract depth information combined to the state-of-the-art Faster-region-based CNN, MSCNCN, and Subcategory-aware Convolutional Neural Network (SubCNN) to yield the models to be tested using the KITTI dataset benchmark. In many cases, the accuracy (in terms of average precision) using their proposal outperforms the original scores in various scenarios of detection difficulty, reaching improvements up to +3.96% in the training and +1.50% in the testing KITTI datasets. This proposal also introduces efficient methods to initialise the weights of the depth convolutional filters during transfer learning using net surgery.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Three‐stage RGBD architecture for vehicle and pedestrian detection using convolutional neural networks and stereo vision

Abstract

Talk to us

Similar Papers

More From: IET Intelligent Transport Systems

Lead the way for us

Journal: IET Intelligent Transport Systems	Publication Date: Sep 2, 2020
Citations: 12

Similar Papers

DSP-Based Traffic Target Detection for Intelligent Transportation
Jianhua Zhang ... Ruyu Liu
IEEE Transactions on Intelligent Transportation Systems | VOL. 24
Jianhua Zhang, et. al.Jianhua Zhang ... Ruyu Liu
01 Nov 2023
IEEE Transactions on Intelligent Transportation Systems | VOL. 24

Research on improved convolutional wavelet neural network
Jingwei Liu ... Jiaxin Li
Scientific Reports | VOL. 11
Jingwei Liu, et. al.Jingwei Liu ... Jiaxin Li
09 Sep 2021
Scientific Reports | VOL. 11

PVformer: Pedestrian and Vehicle Detection Algorithm Based on Swin Transformer in Rainy Scenes
Zaiming Sun ... Guangda Xie
Sensors | VOL. 22
Zaiming Sun, et. al.Zaiming Sun ... Guangda Xie
28 Jul 2022
Sensors | VOL. 22

Pseudo-labeling of transfer learning convolutional neural network data for human facial emotion recognition
Olena О Arsirii ... Denys V Petrosiuk
Herald of Advanced Information Technology | VOL. 6
Olena О Arsirii, et. al.Olena О Arsirii ... Denys V Petrosiuk
12 Oct 2023
Herald of Advanced Information Technology | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Three‐stage RGBD architecture for vehicle and pedestrian detection using convolutional neural networks and stereo vision

Abstract

Talk to us

Similar Papers

More From: IET Intelligent Transport Systems