Abstract

This paper considers a model of object recognition in images using convolutional neural networks; the efficiency of the model-based process involving the training of deep layers in convolutional neural networks has been studied. There are objective difficulties associated with determining the optimal characteristics of neural networks, so there is an issue related to retraining a neural network. Eliminating the retraining by determining only the optimal number of epochs is insufficient since it does not provide high accuracy. The requirements for the set of images for model training and verification have been defined. These requirements are better met by the INRIA image set (France). GoogLeNet (USA) has been established to be a trained model that can perform object recognition on images but the object recognition reliability is insufficient. Therefore, it becomes necessary to improve the effectiveness of object recognition in images. It is advisable to use the GoogLeNet architecture to build a specialized model that, by changing the parameters and retraining some layers, could allow for better recognition of objects in images. Ten models were trained using the following parameters: learning speed, the number of epochs, an optimization algorithm, the type of learning speed change, a gamma or power coefficient, a pre-trained model. A convolutional neural network has been developed to improve the precision and efficiency of object recognition in images. The optimal neural network training parameters were determined: training speed, 0.000025; the number of epochs, 100; a power coefficient, 0.25, etc. A 3 % increase in precision was obtained, which makes it possible to assert the proper choice of the architecture for the developed network and the selection of its parameters. That allows this network to be used for practical tasks of object recognition in images.

Highlights

  • Image processing is extremely important in modern science and practice, so it is constantly evolving and improving

  • The results of our study show that the Inria-10 trained model demonstrates the high accuracy of object recognition in images (Fig. 11)

  • We have investigated the models of Inria-1, Inria-2, Inria-3, Inria-4, Inria-5, Inria-6, Inria-7, Inria-8, Inria-9 neural networks based on the INRIA set

Read more

Summary

Introduction

Image processing is extremely important in modern science and practice, so it is constantly evolving and improving. A modern relevant industrial area is the development of precision agriculture, which is based on the results from agricultural monitoring These data, acquired from UAV video cameras, make it possible to assess the harvested crop, control the routes of movement of agricultural machinery, predict yields, etc. In this case, an important criterion is the UAV’s ability to avoid collisions with close objects, determine the position in space, direction, and trajectory of the flight by receiving input data on the recognized objects. The effectiveness of these systems is determined by the precision of object recognition whose evaluation requires experimental research

Literature review and problem statement
The aim and objectives of the study
The study materials and methods
Conclusions
Findings
15. Deep Learning
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.