AdaGCR: An improved method for optimizing machine learning training process

Mingtao Hu

doi:10.54254/2755-2721/50/20241097

Abstract

In contemporary machine learning, training datasets are typically divided into batches, and models are updated incrementally through batch iterations to save memory and reduce overfitting. However, determining the optimal hyperparameters like learning rate, batch size and number of epochs remains a challenge which often relying on empirical insights. This paper explores a novel method called Adaptive Gradient Conflict Rate (AdaGCR) to optimize the training process. It leverages the idea of gradient conflict rate, which reflects the models position within a batch model set and accordingly adjusts the global learning rate. This proposed method is tested by training a Deep Neural Network (DNN) model with MNIST dataset which represents simple tasks and a ResNet-18 model with CIFAR-10 dataset which represents more complicated tasks for solving real world problems. Experiments conducted on DNN demonstrates the proposed methods effectiveness in reducing overfitting and enhancing convergence, particularly with a well-suited initial learning rate. However, its applicability to more complex models like ResNet-18 may require further refinements, such as layer-specific learning rate adjustments. Future research should focus on fine-tuning AdaGCR and extending its utility across diverse machine learning models and tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AdaGCR: An improved method for optimizing machine learning training process

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering

Lead the way for us

Journal: Applied and Computational Engineering	Publication Date: Mar 25, 2024
License type: cc-by

Similar Papers

Solution-Phase DNA-Compatible Pictet-Spengler Reaction Aided by Machine Learning Building Block Filtering.
Ke Li ... Zhiqiang Duan
iScience | VOL. 23
Ke Li, et. al.Ke Li ... Zhiqiang Duan
07 May 2020
iScience | VOL. 23

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

Fast Robustness Prediction for Deep Neural Network
Yuehuan Wang ... Zenan Li
-
Yuehuan Wang, et. al.Yuehuan Wang ... Zenan Li
28 Oct 2019
28 Oct 2019

An optimization Strategy for Deep Neural Networks Training
Tingting Wu ... Peng Zeng
-
Tingting Wu, et. al.Tingting Wu ... Peng Zeng
28 Oct 2022
28 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AdaGCR: An improved method for optimizing machine learning training process

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering