Obtaining Lower Query Complexities Through Lightweight Zeroth-Order Proximal Gradient Algorithms.

Bin Gu,Bin Gu,Bin Gu,Bin Gu,Heng Huang,Hualin Zhang,Xiyuan Wei,Yi Chang

doi:10.1162/neco_a_01636

Abstract

Zeroth-order (ZO) optimization is one key technique for machine learning problems where gradient calculation is expensive or impossible. Several variance, reduced ZO proximal algorithms have been proposed to speed up ZO optimization for nonsmooth problems, and all of them opted for the coordinated ZO estimator against the random ZO estimator when approximating the true gradient, since the former is more accurate. While the random ZO estimator introduces a larger error and makes convergence analysis more challenging compared to coordinated ZO estimator, it requires only O(1) computation, which is significantly less than O(d) computation of the coordinated ZO estimator, with d being dimension of the problem space. To take advantage of the computationally efficient nature of the random ZO estimator, we first propose a ZO objective decrease (ZOOD) property that can incorporate two different types of errors in the upper bound of convergence rate. Next, we propose two generic reduction frameworks for ZO optimization, which can automatically derive the convergence results for convex and nonconvex problems, respectively, as long as the convergence rate for the inner solver satisfies the ZOOD property. With the application of two reduction frameworks on our proposed ZOR-ProxSVRG and ZOR-ProxSAGA, two variance-reduced ZO proximal algorithms with fully random ZO estimators, we improve the state-of-the-art function query complexities from Omindn1/2ε2,dε3 to O˜n+dε2 under d>n12 for nonconvex problems, and from Odε2 to O˜nlog1ε+dε for convex problems. Finally, we conduct experiments to verify the superiority of our proposed methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Obtaining Lower Query Complexities Through Lightweight Zeroth-Order Proximal Gradient Algorithms.

Abstract

Talk to us

Similar Papers

More From: Neural computation

Lead the way for us

Similar Papers

Recent Progress in Zeroth Order Optimization and Its Applications to Adversarial Robustness in Data Mining and Machine Learning
Pin-Yu Chen ... Sijia Liu
-
Pin-Yu Chen, et. al.Pin-Yu Chen ... Sijia Liu
25 Jul 2019
25 Jul 2019

Decentralized Zeroth-Order Constrained Stochastic Optimization Algorithms: Frank–Wolfe and Variants With Applications to Black-Box Adversarial Attacks
Anit Kumar Sahu ... Soummya Kar
Proceedings of the IEEE | VOL. 108
Anit Kumar Sahu, et. al.Anit Kumar Sahu ... Soummya Kar
18 Aug 2020
Proceedings of the IEEE | VOL. 108

A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning: Principals, Recent Advances, and Applications
Sijia Liu ... Alfred O Hero
IEEE Signal Processing Magazine | VOL. 37
Sijia Liu, et. al.Sijia Liu ... Alfred O Hero
01 Sep 2020
IEEE Signal Processing Magazine | VOL. 37

Study of Pre-Analytical Errors in a Clinical Biochemistry Laboratory
Dr Srirekha P ... Dr Sarada U
East African Scholars Journal of Medical Sciences | VOL. 5
Dr Srirekha P, et. al.Dr Srirekha P ... Dr Sarada U
10 Jan 2022
East African Scholars Journal of Medical Sciences | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Obtaining Lower Query Complexities Through Lightweight Zeroth-Order Proximal Gradient Algorithms.

Abstract

Talk to us

Similar Papers

More From: Neural computation