Convolutional neural networks for solving computer vision problems

State University Of Telecommunications ,O V Zinchenko

doi:10.31673/2412-4338.2022.020411

Abstract

This article provides an overview of the main methods of solving computer vision problems of classification, segmentation and image processing, which are implemented in CV systems. Computer vision systems are programmed to perform highly specialized tasks, capable of detecting objects during identification, reading serial numbers, and searching for surface defects. When applying deep learning methods in CV systems, their processing speed on large data sets and the accuracy of image classification/segmentation are significantly increased. Artificial vision systems are able to identify individual pixels according to the relevant features during processing, provide a high-quality result in pattern recognition, image restoration, and fitting part of the image. Although some computer vision algorithms were developed to simulate visual perception, a larger number of proposed methods are able to fully process images and determine their characteristic properties. The scope of application of CV systems will continue to expand, as the need for artificial intelligence systems is growing rapidly. The purpose of this article is to provide a structured review of computer vision technologies based on their advantages and disadvantages. The work summarizes the types of CV-systems with artificial intelligence according to the spectrum of their applications, highlights the main problematic areas of their research, such as recognition, identification and detection. The article reviews convolutional neural networks (CNNs), which are successfully applied to the analysis of visual images in deep learning. CNN architectures in some cases outperform artificial neural networks in classification tasks by their performance. Currently, convolutional neural networks are the main tool for classification and recognition of objects, faces in photographs, recognition of video and audio materials. This paper provides a comparative analysis of well-known CNN models: LeNet 5, AlexNet, VGGNet, GoogLeNet, ResNet and their effectiveness in CV systems. Approaches to the modeling of architectures of convolutional neural networks are proposed, which will allow, in the future, to solve the problem of classification in tasks for computer vision, thereby increasing their performance, accuracy and quality of processing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convolutional neural networks for solving computer vision problems

Abstract

Talk to us

Similar Papers

More From: Telecommunication and Information Technologies

Lead the way for us

Similar Papers

Analysis of CNN Architectures for Human Action Recognition in Video
David Silva ... Luis Gonzalez-Gurrola
Computación y Sistemas | VOL. 26
David Silva, et. al.David Silva ... Luis Gonzalez-Gurrola
30 Jun 2022
Computación y Sistemas | VOL. 26

Texture Patterns for Object Recognition and Content-Based Color Image Retrieval

-

21 Dec 2020
21 Dec 2020

CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope
Dulari Bhatt ... Sharnil Pandya
Electronics | VOL. 10
Dulari Bhatt, et. al.Dulari Bhatt ... Sharnil Pandya
11 Oct 2021
Electronics | VOL. 10

Receptive Field Regularization Techniques for Audio Classification and Tagging With Deep Convolutional Neural Networks
Khaled Koutini ... Gerhard Widmer
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29
Khaled Koutini, et. al.Khaled Koutini ... Gerhard Widmer
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convolutional neural networks for solving computer vision problems

Abstract

Talk to us

Similar Papers

More From: Telecommunication and Information Technologies