Exploiting Sample Diversity in Distributed Machine Learning Systems

Zhiqiang Liu,Xuanhua Shi,Hai Jin

doi:10.1109/ccgrid.2016.75

Abstract

With the increase of machine learning scalability, there is a growing need for distributed systems which can execute machine learning algorithms on large clusters. Currently, most distributed machine learning systems are developed based on iterative optimization algorithm and parameter server framework. However, most systems compute on all samples in every iteration and this method consumes too much computing resources since the amount of samples is always too large. In this paper, we study on the sample diversity and find that most samples contribute little to model updating during most iterations. Based on these findings, we propose a new iterative optimization algorithm to reduce the computation load by reusing the iterative computing results. The experiment demonstrates that, compared to the current methods, the algorithm proposed in this paper can reduce about 23% of the whole computation load without increasing of communications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploiting Sample Diversity in Distributed Machine Learning Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

DFS: Joint data formatting and sparsification for efficient communication in Distributed Machine Learning
Cheng Yang ... Hongli Xu
Computer Networks | VOL. 229
Cheng Yang, et. al.Cheng Yang ... Hongli Xu
19 Apr 2023
Computer Networks | VOL. 229

Timed Dataflow: Reducing Communication Overhead for Distributed Machine Learning Systems
Peng Sun ... Ta Nguyen Binh Duong
-
Peng Sun, et. al.Peng Sun ... Ta Nguyen Binh Duong
01 Dec 2016
01 Dec 2016

RM-KVStore: New MXNet KVStore to Accelerate Transfer Performancewith RDMA
Baocai Lv ... Zhiguang Chen
-
Baocai Lv, et. al.Baocai Lv ... Zhiguang Chen
01 Jun 2018
01 Jun 2018

Online Job Scheduling in Distributed Machine Learning Clusters
Yixin Bao ... Yanghua Peng
-
Yixin Bao, et. al.Yixin Bao ... Yanghua Peng
01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploiting Sample Diversity in Distributed Machine Learning Systems

Abstract

Talk to us

Similar Papers