A high performance implementation of Zolo-SVD algorithm on distributed memory systems

Shengguo Li,Jie Liu,Yunfei Du

doi:10.1016/j.parco.2019.04.004

Abstract

This paper introduces a high performance implementation of the Zolo-SVD algorithm on distributed memory systems, which is based on the polar decomposition (PD) algorithm via the Zolotarev’s function (Zolo-PD), originally proposed by Nakatsukasa and Freund [SIAM Review, 2016]. Our implementation highly relies on the routines of ScaLAPACK and therefore it is portable. Compared with the other PD algorithms such as the QR-based dynamically weighted Halley method (QDWH-PD), Zolo-PD is naturally parallelizable and has better scalability though performs more floating-point operations. When using many processors, Zolo-PD is usually 1.20 times faster than the QDWH-PD algorithm, and Zolo-SVD can be about two times faster than the ScaLAPACK routine PDGESVD. These numerical experiments are performed on Tianhe-2A supercomputer, one of the fastest supercomputers in the world, and the tested matrices include some sparse matrices from particular applications and some randomly generated dense matrices with different dimensions. Our QDWH-SVD and Zolo-SVD implementations are freely available at https://github.com/shengguolsg/Zolo-SVD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A high performance implementation of Zolo-SVD algorithm on distributed memory systems

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Journal: Parallel Computing	Publication Date: Apr 22, 2019
Citations: 1

Similar Papers

High Performance Polar Decomposition on Distributed Memory Systems
Dalal Sukkari ... David Keyes
-
Dalal Sukkari, et. al.Dalal Sukkari ... David Keyes
01 Jan 2015
01 Jan 2015

Fast and highly scalable parallel computations for fundamental matrix problems on distributed memory systems
Keqin Li
The Journal of Supercomputing | VOL. 54
Keqin LiKeqin Li
29 Jul 2009
The Journal of Supercomputing | VOL. 54

Fast and Scalable Parallel Matrix Computations on Distributed Memory Systems
Keqin Li
-
Keqin Li Keqin Li
04 Apr 2005
04 Apr 2005

Massively Parallel Polar Decomposition on Distributed-memory Systems
Hatem Ltaief ... Dalal Sukkari
ACM Transactions on Parallel Computing | VOL. 6
Hatem Ltaief, et. al.Hatem Ltaief ... Dalal Sukkari
31 Mar 2019
ACM Transactions on Parallel Computing | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A high performance implementation of Zolo-SVD algorithm on distributed memory systems

Abstract

Talk to us

Similar Papers

More From: Parallel Computing