Improved training of deep convolutional networks via minimum-variance regularized adaptive sampling

Alfonso Rojas-Domínguez,Manuel Ornelas-Rodríguez,Martín Carpio,S Ivvan Valdez

doi:10.1007/s00500-022-07131-7

Abstract

Fostered by technological and theoretical developments, deep neural networks (DNNs) have achieved great success in many applications, but their training via mini-batch stochastic gradient descent (SGD) can be very costly due to the possibly tens of millions of parameters to be optimized and the large amounts of training examples that must be processed. The computational cost is exacerbated by the inefficiency of the uniform sampling typically used by SGD to form the training mini-batches: since not all training examples are equally relevant for training, sampling these under a uniform distribution is far from optimal, making the case for the study of improved methods to train DNNs. A better strategy is to sample the training instances under a distribution where the probability of being selected is proportional to the relevance of each individual instance; one way to achieve this is through importance sampling (IS), which minimizes the gradients’ variance w.r.t. the network parameters, consequently improving convergence. In this paper, an IS-based adaptive sampling method to improve the training of DNNs is introduced. This method exploits side information to construct the optimal sampling distribution and is dubbed regularized adaptive sampling (RAS). Experimental comparison using deep convolutional networks for classification of the MNIST and CIFAR-10 datasets shows that when compared against SGD and against another sampling method in the state of the art, RAS produces improvements in the speed and variance of the training process without incurring significant overhead or affecting the classification.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved training of deep convolutional networks via minimum-variance regularized adaptive sampling

Abstract

Talk to us

Similar Papers

More From: Soft Computing - A Fusion of Foundations, Methodologies and Applications

Lead the way for us

Journal: Soft Computing - A Fusion of Foundations, Methodologies and Applications	Publication Date: May 7, 2022
License type: cc-by

Similar Papers

Optimizing Deep Neural Networks Through Neuroevolution With Stochastic Gradient Descent
Haichao Zhang ... Kuangrong Hao
IEEE transactions on autonomous mental development | VOL. 15
Haichao Zhang, et. al.Haichao Zhang ... Kuangrong Hao
01 Mar 2023
IEEE transactions on autonomous mental development | VOL. 15

Efficient distance metric learning by adaptive sampling and mini-batch stochastic gradient descent (SGD)
Qi Qian ... Jinfeng Yi
Machine Learning | VOL. 99
Qi Qian, et. al.Qi Qian ... Jinfeng Yi
09 Jul 2014
Machine Learning | VOL. 99

An improved optimization technique using Deep Neural Networks for digit recognition
T Senthil ... J Deepika
Soft Computing - A Fusion of Foundations, Methodologies and Applications | VOL. 25
T Senthil, et. al.T Senthil ... J Deepika
05 Sep 2020
Soft Computing - A Fusion of Foundations, Methodologies and Applications | VOL. 25

CDGCN: An Effective and Efficient Algorithm Based on Community Detection for Training Deep and Large Graph Convolutional Networks
Yulong Ma ... Lihua Zhou
-
Yulong Ma, et. al.Yulong Ma ... Lihua Zhou
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved training of deep convolutional networks via minimum-variance regularized adaptive sampling

Abstract

Talk to us

Similar Papers

More From: Soft Computing - A Fusion of Foundations, Methodologies and Applications