A survey of regularization strategies for deep models

Reza Moradi,Reza Berangi,Behrouz Minaei

doi:10.1007/s10462-019-09784-7

Abstract

The most critical concern in machine learning is how to make an algorithm that performs well both on training data and new data. No free lunch theorem implies that each specific task needs its own tailored machine learning algorithm to be designed. A set of strategies and preferences are built into learning machines to tune them for the problem at hand. These strategies and preferences, with the core concern of generalization improvement, are collectively known as regularization. In deep learning, because of a considerable number of parameters, a great many forms of regularization methods are available to the deep learning community. Developing more effective regularization strategies has been the subject of significant research efforts in recent years. However, it is difficult for developers to choose the most suitable strategy for their problem at hand, because there is no comparative study regarding the performance of different strategies. In this paper, at the first step, the most effective regularization methods and their variants are presented and analyzed in a systematic approach. At the second step, comparative research on regularization techniques is presented in which the testing errors and computational costs are evaluated in a convolutional neural network, using CIFAR-10 ( https://www.cs.toronto.edu/~kriz/cifar.html ) dataset. In the end, different regularization methods are compared in terms of accuracy of the network, the number of epochs for the network to be trained and the number of operations per input sample. Also, the results are discussed and interpreted based on the employed strategy. The experiment results showed that weight decay and data augmentation regularizations have little computational side effects so can be used in most applications. In the case of enough computational resources, Dropout family methods are rational to be used. Moreover, in the case of abundant computational resources, batch normalization family and ensemble methods are reasonable strategies to be employed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A survey of regularization strategies for deep models

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review

Lead the way for us

Journal: Artificial Intelligence Review	Publication Date: Dec 5, 2019
Citations: 101

Similar Papers

Clinically Relevant Vulnerabilities of Deep Machine Learning Systems for Skin Cancer Diagnosis
Xinyi Du-Harpur ... Magnus D Lynch
Journal of Investigative Dermatology | VOL. 141
Xinyi Du-Harpur, et. al.Xinyi Du-Harpur ... Magnus D Lynch
12 Sep 2020
Journal of Investigative Dermatology | VOL. 141

Analyzing fusion of regularization techniques in the deep learning‐based intrusion detection system
Ankit Thakkar ... Ritika Lohiya
International Journal of Intelligent Systems | VOL. 36
Ankit Thakkar, et. al.Ankit Thakkar ... Ritika Lohiya
04 Aug 2021
International Journal of Intelligent Systems | VOL. 36

Combining Regularization and Dropout Techniques for Deep Convolutional Neural Network
Zari Farhadi ... Mohammad-Reza Feizi-Derakhshi
-
Zari Farhadi, et. al.Zari Farhadi ... Mohammad-Reza Feizi-Derakhshi
26 Oct 2022
26 Oct 2022

A Gas Classification Algorithm of Electronic Noses Based on Convolutional Spiking Neural Network
Yizhou Xiong ... Hao Wan
Electrochemical Society Meeting Abstracts | VOL. MA2021-01
Yizhou Xiong, et. al.Yizhou Xiong ... Hao Wan
30 May 2021
Electrochemical Society Meeting Abstracts | VOL. MA2021-01

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A survey of regularization strategies for deep models

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Review