An Effective Hard Thresholding Method Based on Stochastic Variance Reduction for Nonconvex Sparse Learning

Guannan Liang,Qianqian Tong,Chunjiang Zhu,Jinbo Bi

doi:10.1609/aaai.v34i02.5519

Abstract

We propose a hard thresholding method based on stochastically controlled stochastic gradients (SCSG-HT) to solve a family of sparsity-constrained empirical risk minimization problems. The SCSG-HT uses batch gradients where batch size is pre-determined by the desirable precision tolerance rather than full gradients to reduce the variance in stochastic gradients. It also employs the geometric distribution to determine the number of loops per epoch. We prove that, similar to the latest methods based on stochastic gradient descent or stochastic variance reduction methods, SCSG-HT enjoys a linear convergence rate. However, SCSG-HT now has a strong guarantee to recover the optimal sparse estimator. The computational complexity of SCSG-HT is independent of sample size n when n is larger than 1/ε, which enhances the scalability to massive-scale problems. Empirical results demonstrate that SCSG-HT outperforms several competitors and decreases the objective value the most with the same computational costs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Effective Hard Thresholding Method Based on Stochastic Variance Reduction for Nonconvex Sparse Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 2

Similar Papers

Momentum and stochastic momentum for stochastic gradient, Newton, proximal point and subspace descent methods
Nicolas Loizou ... Peter Richtárik
Computational Optimization and Applications | VOL. 77
Nicolas Loizou, et. al.Nicolas Loizou ... Peter Richtárik
23 Sep 2020
Computational Optimization and Applications | VOL. 77

Bi-fidelity stochastic gradient descent for structural optimization under uncertainty
Subhayan De ... Kurt Maute
Computational Mechanics | VOL. 66
Subhayan De, et. al.Subhayan De ... Kurt Maute
03 Aug 2020
Computational Mechanics | VOL. 66

Kalman-Based Stochastic Gradient Method with Stop Condition and Insensitivity to Conditioning
Vivak Patel
SIAM Journal on Optimization | VOL. 26
Vivak PatelVivak Patel
01 Jan 2015
SIAM Journal on Optimization | VOL. 26

Numerical methods for distributed stochastic compositional optimization problems with aggregative structure
Shengchao Zhao ... Yongchao Liu
Optimization Methods and Software | VOL. ahead-of-print
Shengchao Zhao, et. al.Shengchao Zhao ... Yongchao Liu
25 Jul 2024
Optimization Methods and Software | VOL. ahead-of-print

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Effective Hard Thresholding Method Based on Stochastic Variance Reduction for Nonconvex Sparse Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence