Large-scale machine learning with fast and stable stochastic conjugate gradient

Zhuang Yang

doi:10.1016/j.cie.2022.108656

Abstract

In deterministic optimization, conjugate gradient (CG) type approaches are preferred with a superior convergence rate than the ordinary gradient approaches. The requirement of solving large-scale data, growing exponentially, makes recent works study the effectiveness of the CG-type approaches with stochastic approximation, especially for large-scale machine learning problems. However, it is challenging that how to incorporate the noisy gradients into CG-type approaches. In this paper, we develop a class of fast and robust stochastic conjugate gradient (SCG) type approach via using the stochastic recursive gradient algorithm (SARAH) and the hyper-gradient descent (HD) technique in the mini-batching setting. That the use of the SARAH gradient estimator makes the proposed approaches enjoy the low variance accelerates the convergence rate and saves the gradient complexity of the conventional SCG-type approach. In addition, using HD to determine the learning rate for the SCG-type approach greatly saves the computational burden, comparing with the existing literature that usually works with the line search technique in practice. We rigorously prove that the proposed approach attains a linear convergence rate for strongly convex loss functions and show that its complexity matches modern stochastic optimization approaches. Various experimental results on machine learning problems are provided to demonstrate the property and the effectiveness of the proposed approaches respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Large-scale machine learning with fast and stable stochastic conjugate gradient

Abstract

Talk to us

Similar Papers

More From: Computers & Industrial Engineering

Lead the way for us

Journal: Computers & Industrial Engineering	Publication Date: Sep 13, 2022
Citations: 7

Similar Papers

Adaptive stochastic conjugate gradient for machine learning
Zhuang Yang
Expert Systems with Applications | VOL. 206
Zhuang YangZhuang Yang
09 Jun 2022
Expert Systems with Applications | VOL. 206

A Family of Hybrid Stochastic Conjugate Gradient Algorithms for Local and Global Minimization Problems
Khalid Abdulaziz Alnowibet ... Salem Mahdi
Mathematics | VOL. 10
Khalid Abdulaziz Alnowibet, et. al.Khalid Abdulaziz Alnowibet ... Salem Mahdi
01 Oct 2022
Mathematics | VOL. 10

A Blind Equalization Algorithm Based on Global Artificial Fish Swarm and Genetic Optimization DNA Encoding Sequences
Hui Wang ... Yecai Guo
-
Hui Wang, et. al.Hui Wang ... Yecai Guo
01 Jan 2015
01 Jan 2015

SARAH-M: A fast stochastic recursive gradient descent algorithm via momentum
Zhuang Yang
Expert Systems With Applications | VOL. 238
Zhuang YangZhuang Yang
31 Oct 2023
Expert Systems With Applications | VOL. 238

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large-scale machine learning with fast and stable stochastic conjugate gradient

Abstract

Talk to us

Similar Papers

More From: Computers &amp; Industrial Engineering

More From: Computers & Industrial Engineering