A Sharper Generalization Bound for Divide-and-Conquer Ridge Regression

Shusen Wang

doi:10.1609/aaai.v33i01.33015305

Abstract

We study the distributed machine learning problem where the n feature-response pairs are partitioned among m machines uniformly at random. The goal is to approximately solve an empirical risk minimization (ERM) problem with the minimum amount of communication. The divide-and-conquer (DC) method, which was proposed several years ago, lets every worker machine independently solve the same ERM problem using its local feature-response pairs and the driver machine combine the solutions. This approach is in one-shot and thereby extremely communication-efficient. Although the DC method has been studied by many prior works, reasonable generalization bound has not been established before this work.For the ridge regression problem, we show that the prediction error of the DC method on unseen test samples is at most ε times larger than the optimal. There have been constantfactor bounds in the prior works, their sample complexities have a quadratic dependence on d, which does not match the setting of most real-world problems. In contrast, our bounds are much stronger. First, our 1 + ε error bound is much better than their constant-factor bounds. Second, our sample complexity is merely linear with d.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Sharper Generalization Bound for Divide-and-Conquer Ridge Regression

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 7

Similar Papers

New Optimization Methods for Modern Machine Learning

-

01 Jan 2017
01 Jan 2017

Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization
Pavel Dvurechensky ... Dmitry Kamzolov
EURO Journal on Computational Optimization | VOL. 10
Pavel Dvurechensky, et. al.Pavel Dvurechensky ... Dmitry Kamzolov
01 Jan 2021
EURO Journal on Computational Optimization | VOL. 10

The Double-Accelerated Stochastic Method for Regularized Empirical Risk Minimization
Liu Liu ... Dacheng Tao
IEEE Transactions on Emerging Topics in Computational Intelligence | VOL. 3
Liu Liu, et. al.Liu Liu ... Dacheng Tao
01 Dec 2019
IEEE Transactions on Emerging Topics in Computational Intelligence | VOL. 3

Nonconvex optimization with inertial proximal stochastic variance reduction gradient
Lulu He ... Jianwei E
Information Sciences | VOL. 648
Lulu He, et. al.Lulu He ... Jianwei E
19 Aug 2023
Information Sciences | VOL. 648

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Sharper Generalization Bound for Divide-and-Conquer Ridge Regression

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence