ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization

Xunpeng Huang,Runxin Xu,Hao Zhou,Zhe Wang,Zhengyang Liu,Lei Li

doi:10.1609/aaai.v35i9.16959

Abstract

Stochastic gradient descent (SGD) is a widely used method for its outstanding generalization ability and simplicity. Adaptive gradient methods have been proposed to further accelerate the optimization process. In this paper, we revisit existing adaptive gradient optimization methods with a new interpretation. Such new perspective leads to a refreshed understanding of the roles of second moments in stochastic optimization. Based on this, we propose Angle-Calibration Moment method (ACMo), a novel stochastic optimization method. It enjoys the benefits of second moments with only first moment updates. Theoretical analysis shows that ACMo is able to achieve the same convergence rate as mainstream adaptive methods. Experiments on a variety of CV and NLP tasks demonstrate that ACMo has a comparable convergence to state-of-the-art Adam-type optimizers, and even a better generalization performance in most cases. The code is available at https://github.com/Xunpeng746/ACMo.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 1

Similar Papers

Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks
Jinghui Chen ... Quanquan Gu
-
Jinghui Chen, et. al.Jinghui Chen ... Quanquan Gu
21 Dec 2018
21 Dec 2018

An automatic learning rate decay strategy for stochastic gradient descent optimization methods in neural networks
Kang Wang ... Dong Wen
International Journal of Intelligent Systems | VOL. 37
Kang Wang, et. al.Kang Wang ... Dong Wen
31 Mar 2022
International Journal of Intelligent Systems | VOL. 37

Robustness of Adaptive Neural Network Optimization Under Training Noise
Subhajit Chaudhury ... Toshihiko Yamasaki
IEEE access : practical innovations, open solutions | VOL. 9
Subhajit Chaudhury, et. al.Subhajit Chaudhury ... Toshihiko Yamasaki
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 9

Solving Stochastic Compositional Optimization is Nearly as Easy as Solving Stochastic Optimization
Tianyi Chen ... Yuejiao Sun
IEEE Transactions on Signal Processing | VOL. 69
Tianyi Chen, et. al.Tianyi Chen ... Yuejiao Sun
01 Jan 2020
IEEE Transactions on Signal Processing | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence