Distributed Kernel Matrix Approximation and Implementation Using Message Passing Interface

Taher A Dameh,Wael Abd-Almageed,Mohamed Hefeeda

doi:10.1109/icmla.2013.17

Abstract

We propose a distributed method to compute similarity (also known as kernel and Gram) matrices used in various kernel-based machine learning algorithms. Current methods for computing similarity matrices have quadratic time and space complexities, which make them not scalable to large-scale data sets. To reduce these quadratic complexities, the proposed method first partitions the data into smaller subsets using various families of locality sensitive hashing, including random project and spectral hashing. Then, the method computes the similarity values among points in the smaller subsets to result in approximated similarity matrices. We analytically show that the time and space complexities of the proposed method are sub quadratic. We implemented the proposed method using the Message Passing Interface (MPI) framework and ran it on a cluster. Our results with real large-scale data sets show that the proposed method does not significantly impact the accuracy of the computed similarity matrices and it achieves substantial savings in running time and memory requirements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distributed Kernel Matrix Approximation and Implementation Using Message Passing Interface

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Distributed approximate spectral clustering for large-scale datasets
Mohamed Hefeeda ... Fei Gao
-
Mohamed Hefeeda, et. al.Mohamed Hefeeda ... Fei Gao
18 Jun 2012
18 Jun 2012

Large scale nearest neighbor search -- theories, algorithms, and applications
...
-
, et. al. ...
01 Jan 2014
01 Jan 2014

Prediction of arch dam deformation via correlated multi-target stacking
Siyu Chen ... Mohammad Amin Hariri-Ardebili
Applied Mathematical Modelling | VOL. 91
Siyu Chen, et. al.Siyu Chen ... Mohammad Amin Hariri-Ardebili
31 Oct 2020
Applied Mathematical Modelling | VOL. 91

Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization
Shifeng Zhang ... Jinma Guo
-
Shifeng Zhang, et. al.Shifeng Zhang ... Jinma Guo
01 Dec 2016
01 Dec 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distributed Kernel Matrix Approximation and Implementation Using Message Passing Interface

Abstract

Talk to us

Similar Papers