CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope

Dulari Bhatt,Jigar Patel,Chirag Patel,Kirit Modi,Rasmika Vaghela,Hemant Ghayvat,Hardik Talsania,Sharnil Pandya

doi:10.3390/electronics10202470

Dulari Bhatt, Jigar Patel + Show 6 more

Open Access

PDF Available

https://doi.org/10.3390/electronics10202470

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Computer vision is becoming an increasingly trendy word in the area of image processing. With the emergence of computer vision applications, there is a significant demand to recognize objects automatically. Deep CNN (convolution neural network) has benefited the computer vision community by producing excellent results in video processing, object recognition, picture classification and segmentation, natural language processing, speech recognition, and many other fields. Furthermore, the introduction of large amounts of data and readily available hardware has opened new avenues for CNN study. Several inspirational concepts for the progress of CNN have been investigated, including alternative activation functions, regularization, parameter optimization, and architectural advances. Furthermore, achieving innovations in architecture results in a tremendous enhancement in the capacity of the deep CNN. Significant emphasis has been given to leveraging channel and spatial information, with a depth of architecture and information processing via multi-path. This survey paper focuses mainly on the primary taxonomy and newly released deep CNN architectures, and it divides numerous recent developments in CNN architectures into eight groups. Spatial exploitation, multi-path, depth, breadth, dimension, channel boosting, feature-map exploitation, and attention-based CNN are the eight categories. The main contribution of this manuscript is in comparing various architectural evolutions in CNN by its architectural change, strengths, and weaknesses. Besides, it also includes an explanation of the CNN’s components, the strengths and weaknesses of various CNN variants, research gap or open challenges, CNN applications, and the future research direction.

Highlights

Artificial intelligence is bridging the gap between machine and human talents at a breakneck pace
A convolutional neural network is a method of deep learning that takes an input image and assigns importance to various objects in the image, distinguishing one from the other [1]
In 2015, Ren proposed a rapid region-based CNN (R-CNN) for object detection as an enhancement over R-CNN, which is employed in their research for feature extraction and is able to recognize the boundary and score of objects at various positions concurrently

Summary

Introduction

Artificial intelligence is bridging the gap between machine and human talents at a breakneck pace. It represents a wide range of CNN architectures, from their conception to their most current advancements and achievements This survey will assist readers in developing architectural novelties in convolutional neural networks by providing a more profound theoretical knowledge of CNN design concepts. This survey will assist readers in learning more about CNN and CNN variants, which helps to improve the field The contributions of this survey paper are summarized below: This is the first review that almost provides each detail about CNN for computer vision, its history, CNN architecture designs, its merits and demerits, application, and the future work to take upon in a single paper. It provides a clear listing of future research trends in the area of CNN for computer vision The organization of this survey paper includes the first section, which presents a methodical comprehension of CNN.

CNN Fundamentals

Pooling Layer

Fully Connected Layer

Spatial Exploitation-Based CNNs

CNN Based on Depth

CNNs with Multiple Paths

Feature-Map Exploitation Based CNNs

Multi-Connection Depending on the Width

CNNs That Are Based on Attention

Dimension-Based CNN

Object Detection and Segmentation

Image Classification

Recognition of Speech

Video Processing

Images with Low Resolution

System with Limited Resources

CNN for Various Dimensional Data

3.10. Object Counting

CNN Challenges

Future Directions

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Oct 11, 2021
Citations: 386	License type: CC BY 4.0

R Discovery Prime

CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

A survey of the recent architectures of deep convolutional neural networks
Asifullah Khan ... Anabia Sohail
Artificial Intelligence Review | VOL. 53
Asifullah Khan, et. al.Asifullah Khan ... Anabia Sohail
21 Apr 2020
Artificial Intelligence Review | VOL. 53

Texture Patterns for Object Recognition and Content-Based Color Image Retrieval

-

21 Dec 2020
21 Dec 2020

Convolutional Neural Network and Its Advances: Overview and Applications
Jyoti S Raghatwan ... Sandhya Arora
-
Jyoti S Raghatwan, et. al.Jyoti S Raghatwan ... Sandhya Arora
20 Sep 2021
20 Sep 2021

Receptive Field Regularization Techniques for Audio Classification and Tagging With Deep Convolutional Neural Networks
Khaled Koutini ... Hamid Eghbal-Zadeh
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Khaled Koutini, et. al.Khaled Koutini ... Hamid Eghbal-Zadeh
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Electronics