Multimodal pedestrian detection using metaheuristics with deep convolutional neural network in crowded scenes

Deepak Kumar Jain,Xudong Zhao,Germán González-Almagro,Chenquan Gan,Ketan Kotecha

doi:10.1016/j.inffus.2023.02.014

Abstract

Pedestrian detection (PD) is a vital computer vision (CV) problem that is highly employed in several real-time applications, namely autonomous driving methods, robotics, and security observing methods. Simulated by deep learning (DL) approaches to the recognition of generic objects, several investigation mechanisms have attained maximum recognition accuracy for acceptable scale and non-blocked pedestrians. However, the detection efficiency needed to be improved for complex cases like rare pose samples, crowd scenes, and cases with worse visibility due to daytime or weather. Therefore, this study develops a multimodal pedestrian detection system in crowded scenes using metaheuristics and a deep convolutional neural network (MMPD-MDCNN) technique. The MMPD-MDCNN technique’s goal is to identify pedestrians in crowd scenes using different deep-learning models effectively. The proposed MMPD-MDCNN technique integrates three deep learning models: the residual network (ResNet-50), Inception v3, and the capsule network (CapsNet). In addition, the Harris Hawks Optimization (HHO) algorithm is applied for optimal hyperparameter tuning of the deep learning models. For pedestrian detection, the MMPD-MDCNN technique uses the long short-term memory (LSTM) model, and its hyperparameters can be adjusted by the shark smell optimization (SSO) algorithm. To demonstrate the superior performance of the MMPD-MDCNN approach, A comprehensive set of simulations on the INRIA and UCSD datasets was performed to illustrate the superior performance of the MMPD-MDCNN approach. The experimental results suggest that the MMPD-MDCNN model performs well on both datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal pedestrian detection using metaheuristics with deep convolutional neural network in crowded scenes

Abstract

Talk to us

Similar Papers

More From: Information Fusion

Lead the way for us

Journal: Information Fusion	Publication Date: Feb 14, 2023
Citations: 21

Similar Papers

Deep learning‐based robust medical image watermarking exploiting DCT and Harris hawks optimization
Anusha Chacko ... Shanty Chacko
International Journal of Intelligent Systems | VOL. 37
Anusha Chacko, et. al.Anusha Chacko ... Shanty Chacko
22 Nov 2021
International Journal of Intelligent Systems | VOL. 37

Convolutional Nonlinear Differential Recurrent Neural Networks for Crowd Scene Understanding
Naifan Zhuang ... Kien A Hua
International Journal of Semantic Computing | VOL. 12
Naifan Zhuang, et. al.Naifan Zhuang ... Kien A Hua
01 Dec 2018
International Journal of Semantic Computing | VOL. 12

Early Detection of Covid-19 Disease using Computed Tomography Images and Optimized CNN-LSTM
Muhammad Hammad Memon ... Jianping Li
-
Muhammad Hammad Memon, et. al.Muhammad Hammad Memon ... Jianping Li
18 Dec 2020
18 Dec 2020

Refining the Efficiency of R-CNN in Pedestrian Detection
Katleho L Masita ... Ali N Hasan
-
Katleho L Masita, et. al.Katleho L Masita ... Ali N Hasan
10 Sep 2021
10 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal pedestrian detection using metaheuristics with deep convolutional neural network in crowded scenes

Abstract

Talk to us

Similar Papers

More From: Information Fusion