Accelerated Distributed Approximate Newton Method.

Haishan Ye,Chaoyang He,Xiangyu Chang

doi:10.1109/tnnls.2022.3151736

Abstract

Distributed second-order optimization, as an effective strategy for training large-scale machine learning systems, has been widely investigated due to its low communication complexity. However, the existing distributed second-order optimization algorithms, including distributed approximate Newton (DANE), accelerated inexact DANE (AIDE), and statistically preconditioned accelerated gradient (SPAG), are all required to precisely solve an expensive subproblem up to the target precision. Therefore, this causes these algorithms to suffer from high computation costs and this hinders their development. In this article, we design a novel distributed second-order algorithm called the accelerated distributed approximate Newton (ADAN) method to overcome the high computation costs of the existing ones. Compared with DANE, AIDE, and SPAG, which are constructed based on the relative smooth theory, ADAN's theoretical foundation is built upon the inexact Newton theory. The different theoretical foundations lead to handle the expensive subproblem efficiently, and steps required to solve the subproblem are independent of the target precision. At the same time, ADAN resorts to the acceleration and can effectively exploit the objective function's curvature information, making ADAN to achieve a low communication complexity. Thus, ADAN can achieve both the communication and computation efficiencies, while DANE, AIDE, and SPAG can achieve only the communication efficiency. Our empirical study also validates the advantages of ADAN over extant distributed second-order algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Accelerated Distributed Approximate Newton Method.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Nov 1, 2023
Citations: 3

Similar Papers

Time-Varying Optimization and Its Application to Power System Operation

-

01 Jan 2019
01 Jan 2019

Comparison of Newton's and quasi-Newton's method solvers for the Navier-Stokes equations
Paul D Orkwis
AIAA Journal | VOL. 31
Paul D OrkwisPaul D Orkwis
01 May 1993
AIAA Journal | VOL. 31

Distributed Approximate Newton's Method Robust to Byzantine Attackers
Xinyang Cao ... Lifeng Lai
IEEE Transactions on Signal Processing | VOL. 68
Xinyang Cao, et. al.Xinyang Cao ... Lifeng Lai
01 Jan 2020
IEEE Transactions on Signal Processing | VOL. 68

Communication-efficient distributed cubic Newton with compressed lazy Hessian
Zhen Zhang ... Wenying Xu
Neural Networks | VOL. 174
Zhen Zhang, et. al.Zhen Zhang ... Wenying Xu
27 Feb 2024
Neural Networks | VOL. 174

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerated Distributed Approximate Newton Method.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems