Research and implementation of NN based on TensorFlow

B V Shefkin,V O Khomenchuk,K P Storchak,A M Tushych,I V Krasyuk

doi:10.31673/2412-9070.2020.062025

Abstract

TensorFlow is Google’s open-source machine learning and deep learning framework, which is convenient and flexible to build the current mainstream deep learning model. Convolutional neural network is a classical model of deep learning, the advantage lies in its powerful feature extraction capabilities of convolutional blocks. A neural network in the simplest case is a mathematical model consisting of several layers of elements that perform parallel calculations. Initially, such an architecture was created by analogy with the small computing elements of the human brain — neurons. The minimal computing elements of an artificial neural network are also called neurons. Neural networks typically consist of three or more layers: an input layer, a hidden layer (or layers), and an output layer. An important feature of the neural network is its ability to learn by example, this is called learning with a teacher. The neural network is trained on a large number of examples consisting of input-output pairs (corresponding to each other input and output). In object recognition problems, such a pair will be the input image and the corresponding label — the name of the object. Neural network learning is an iterative process that reduces the deviation of the network output from a given «teacher response» — a label that corresponds to a given image. This process consists of steps called epochs of learning (they are usually calculated in thousands), each of which is the adjustment of the «weights» of the neural network — the parameters of the hidden layers of the network. Upon completion of the learning process, the quality of the neural network is usually good enough to perform the task for which it was trained, although the optimal set of parameters that perfectly recognizes all the images, it is often impossible to choose. Based on the TensorFlow platform, a convolutional neural network model with two-convolution-layers was built. The model was trained and tested with the MNIST data set. The test accuracy rate could reach 99,15%, and compared with the rate of 98,69% with only one-convolution-layer model, which shows that the two-convolution-layers convolutional neural network model has a better ability of feature extraction and classification decision-making.

Full Text