Kernel-based online gradient descent using distributed approach

Xiaming Chen

doi:10.3934/mfc.2019001

Abstract

In this paper we study the kernel-based online gradient descent with least squares loss without an explicit regularization term. Our approach is novel by controlling the expectation of the K-norm of \begin{document}$ f_t $\end{document} using an iterative process. Then we use distributed learning to improve our result.

Highlights

Different from the classical batch learning which learns from the entire data set, online learning seeks to learn from a data set with an increasing size
The online gradient descent algorithm is defined in the following way: f1 = 0, (2)
In the distributed learning we divide our source of data into J different subsets and we use the online gradient descent algorithm for each subset of data

Summary

Introduction

Different from the classical batch learning which learns from the entire data set, online learning seeks to learn from a data set with an increasing size. The gradient descent method is a powerful algorithm designed to find the optimal value of a function, and online gradient descent is an adaptation to the online scheme. The online gradient descent algorithm has been studied in [9, 15] recently. In [14], the early stopping approach for batch learning is studied. In [9], the author studied an online gradient descent algorithm with a regularized term λft, which can be formulated as follows: f1 = 0,. We call λ the regularization parameter and when λ > 0, the algorithm is called online regularized learning and it has been well studied in [7, 9, 16].

XIAMING CHEN

By taking the average ft

Let the Online

Assume fρ

It holds that

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Foundations of Computing	Publication Date: Jan 1, 2019
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Kernel-based online gradient descent using distributed approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Foundations of Computing

Lead the way for us

Similar Papers

Online Gradient Descent Learning Algorithms
Yiming Ying ... Massimiliano Pontil
Foundations of Computational Mathematics | VOL. 8
Yiming Ying, et. al.Yiming Ying ... Massimiliano Pontil
25 Apr 2007
Foundations of Computational Mathematics | VOL. 8

GrOD : Deep Learning with Gradients Orthogonal Decomposition for Knowledge Transfer, Distillation, and Adversarial Training
Haoyi Xiong ... Jun Huan
ACM Transactions on Knowledge Discovery from Data | VOL. 16
Haoyi Xiong, et. al.Haoyi Xiong ... Jun Huan
08 Sep 2022
ACM Transactions on Knowledge Discovery from Data | VOL. 16

Complexity control by gradient descent in deep networks
Tomaso Poggio ... Qianli Liao
Nature Communications | VOL. 11
Tomaso Poggio, et. al.Tomaso Poggio ... Qianli Liao
24 Feb 2020
Nature Communications | VOL. 11

Assessing the Impact of Different Types of Time-lapse Seismic Data on Permeability Estimation
T Feng ... T Mannseth
-
T Feng, et. al.T Feng ... T Mannseth
06 Sep 2010
06 Sep 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel-based online gradient descent using distributed approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematical Foundations of Computing