Sparse matrix multiplication: The distributed block-compressed sparse row library

Urban Borštnik,Jürg Hutter,Joost Vandevondele,Valéry Weber

doi:10.1016/j.parco.2014.03.012

Urban Borštnik, Jürg Hutter + Show 2 more

Open Access

https://doi.org/10.1016/j.parco.2014.03.012

Copy DOI

Journal: Parallel Computing	Publication Date: Apr 1, 2014
Citations: 170	License type: other-oa

Affiliation: University of Zurich, ETH Zurich

Abstract

Abstract Efficient parallel multiplication of sparse matrices is key to enabling many large-scale calculations. This article presents the DBCSR (Distributed Block Compressed Sparse Row) library for scalable sparse matrix–matrix multiplication and its use in the CP2K program for linear-scaling quantum-chemical calculations. The library combines several approaches to implement sparse matrix multiplication in a way that performs well and is demonstrably scalable. Parallel communication has well-defined limits. Data volume decreases with O ( 1 / P ) with increasing process counts P and every process communicates with at most O ( P ) others. Local sparse matrix multiplication is handled efficiently using a combination of techniques: blocking elements together in an application-relevant way, an autotuning library for small matrix multiplications, cache-oblivious recursive multiplication, and multithreading. Additionally, on-the-fly filtering not only increases sparsity but also avoids performing calculations that fall below the filtering threshold. We demonstrate and analyze the performance of the DBCSR library and its various scaling behaviors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sparse matrix multiplication: The distributed block-compressed sparse row library

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Similar Papers

Semi-External Memory Sparse Matrix Multiplication for Billion-Node Graphs
Da Zheng ... Carey E Priebe
IEEE Transactions on Parallel and Distributed Systems | VOL. 28
Da Zheng, et. al.Da Zheng ... Carey E Priebe
14 Oct 2016
IEEE Transactions on Parallel and Distributed Systems | VOL. 28

Parallel Implementation of Large-Scale Linear Scaling Density Functional Theory Calculations With Numerical Atomic Orbitals in HONPAS.
Zhaolong Luo ... Xinming Qin
Frontiers in Chemistry | VOL. 8
Zhaolong Luo, et. al.Zhaolong Luo ... Xinming Qin
26 Nov 2020
Frontiers in Chemistry | VOL. 8

On the representation and multiplication of hypersparse matrices
Aydin Buluc ... John R Gilbert
-
Aydin Buluc, et. al.Aydin Buluc ... John R Gilbert
01 Apr 2008
01 Apr 2008

VerSA: Versatile Systolic Array Architecture for Sparse and Dense Matrix Multiplications
Juwon Seo ... Joonho Kong
Electronics | VOL. 13
Juwon Seo, et. al.Juwon Seo ... Joonho Kong
15 Apr 2024
Electronics | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sparse matrix multiplication: The distributed block-compressed sparse row library

Abstract

Talk to us

Similar Papers

More From: Parallel Computing