Modified Convolutional Neural Network Based on Dropout and the Stochastic Gradient Descent Optimizer

Jing Yang,Guanci Yang

doi:10.3390/a11030028

Abstract

This study proposes a modified convolutional neural network (CNN) algorithm that is based on dropout and the stochastic gradient descent (SGD) optimizer (MCNN-DS), after analyzing the problems of CNNs in extracting the convolution features, to improve the feature recognition rate and reduce the time-cost of CNNs. The MCNN-DS has a quadratic CNN structure and adopts the rectified linear unit as the activation function to avoid the gradient problem and accelerate convergence. To address the overfitting problem, the algorithm uses an SGD optimizer, which is implemented by inserting a dropout layer into the all-connected and output layers, to minimize cross entropy. This study used the datasets MNIST, HCL2000, and EnglishHand as the benchmark data, analyzed the performance of the SGD optimizer under different learning parameters, and found that the proposed algorithm exhibited good recognition performance when the learning rate was set to [0.05, 0.07]. The performances of WCNN, MLP-CNN, SVM-ELM, and MCNN-DS were compared. Statistical results showed the following: (1) For the benchmark MNIST, the MCNN-DS exhibited a high recognition rate of 99.97%, and the time-cost of the proposed algorithm was merely 21.95% of MLP-CNN, and 10.02% of SVM-ELM; (2) Compared with SVM-ELM, the average improvement in the recognition rate of MCNN-DS was 2.35% for the benchmark HCL2000, and the time-cost of MCNN-DS was only 15.41%; (3) For the EnglishHand test set, the lowest recognition rate of the algorithm was 84.93%, the highest recognition rate was 95.29%, and the average recognition rate was 89.77%.

Highlights

The convolutional neural network (CNN) has attracted considerable attention because of its successful application in target detection, image classification, knowledge acquisition, and image semantic segmentation
Modified Convolutional Neural Network Based on Dropout and the stochastic gradient descent (SGD) Optimizer
On the basis of the design of the CNN structure, the Leaky ReLU activation function, and the overfitting prevention method that is based on dropout and the SGD, the Modified Convolutional

Summary

Introduction

The convolutional neural network (CNN) has attracted considerable attention because of its successful application in target detection, image classification, knowledge acquisition, and image semantic segmentation. In [12], two consecutive convolution operations were appended to each layer of the CNN, thereby increasing the recognition rate of image classification by doubling the number of feature extracts. This procedure has high memory requirements from the system. InWhen the regularization the error objective function the considers theoffactors the the number of strategy, learning layers in the CNN increases, capability these that describe the complexity. The time lock decreases, the cumulative error increases To address address these these problems, problems, this this study study designs designs an an improved improved activation activation function function to to increase increase the the convergence rate by adding a dropout layer between the fully connected and output layers.

Typical CNN Model

Traditional

Dropout Layer

Quadratic CNN Structure

Method Based on Dropout and SGD for Preventing Overfitting

Modified Convolutional Neural Network Based on Dropout and the SGD Optimizer

Test Environment

Comparison Algorithm

Datasets and Settings

Results and and Analysis

Boxplot

Comparison and Analysis of the Three Kinds of Algorithms

10. Boxplot

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms	Publication Date: Mar 7, 2018
Citations: 90	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Modified Convolutional Neural Network Based on Dropout and the Stochastic Gradient Descent Optimizer

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Similar Papers

The Investigation of Multiple Optimization Methods on Convolutional Neural Network
Hetian Pan
Highlights in Science, Engineering and Technology | VOL. 85
Hetian PanHetian Pan
13 Mar 2024
Highlights in Science, Engineering and Technology | VOL. 85

A Generalized Deep Learning Model for Denoising Image Datasets
Kurian Thomas ... Supriya M H
International Journal of Engineering and Advanced Technology | VOL. 10
Kurian Thomas, et. al.Kurian Thomas ... Supriya M H
30 Oct 2020
International Journal of Engineering and Advanced Technology | VOL. 10

A modified time adaptive self-organizing map with stochastic gradient descent optimizer for automated food recognition system
Jameer Gulab Kotwal ... Vinod Kimbahune
Journal of Stored Products Research | VOL. 107
Jameer Gulab Kotwal, et. al.Jameer Gulab Kotwal ... Vinod Kimbahune
23 Apr 2024
Journal of Stored Products Research | VOL. 107

Vision-Based Race Track SLAM Based Only on Lane Curvature
Jongsang Suh ... Eric Yongkeun Choi
IEEE Transactions on Vehicular Technology | VOL. 69
Jongsang Suh, et. al.Jongsang Suh ... Eric Yongkeun Choi
10 Jan 2020
IEEE Transactions on Vehicular Technology | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modified Convolutional Neural Network Based on Dropout and the Stochastic Gradient Descent Optimizer

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms