A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting

Sandra Catalan,Jose R Herrero,Robert Van De Geijn,Enrique S Quintana-Orti,Rafael Rodriguez-Sanchez

doi:10.1109/access.2019.2895541

Sandra Catalan, Jose R Herrero + Show 3 more

Open Access

https://doi.org/10.1109/access.2019.2895541

Copy DOI

Abstract

We propose two novel techniques for overcoming load-imbalance encountered when implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for the solution of linear systems. Both techniques target the scenario where two thread teams are created/activated during the factorization, with each team in charge of performing an independent task/branch of execution. The first technique promotes worker sharing (WS) between the two tasks, allowing the threads of the task that completes first to be reallocated for use by the costlier task. The second technique allows a fast task to alert the slower task of completion, enforcing the early termination (ET) of the second task, and a smooth transition of the factorization procedure into the next iteration. The two mechanisms are instantiated via a new malleable thread-level implementation of the basic linear algebra subprograms , and their benefits are illustrated via an implementation of the LU factorization with partial pivoting enhanced with look-ahead. Concretely, our experimental results on an Intel-Xeon system with 12 cores show the benefits of combining WS+ET, reporting competitive performance in comparison with a task-parallel runtime-based solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 12	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Software Libraries for Linear Algebra Computations on High Performance Computers
Jack J Dongarra ... David W Walker
SIAM Review | VOL. 37
Jack J Dongarra, et. al.Jack J Dongarra ... David W Walker
01 Jun 1995
SIAM Review | VOL. 37

The design of linear algebra libraries for high performance computers
J.J Dongarra ... D.W Walker
-
J.J Dongarra, et. al.J.J Dongarra ... D.W Walker
01 Aug 1993
01 Aug 1993

Algorithm, Architecture, and Floating-Point Unit Codesign of a Matrix Factorization Accelerator
Ardavan Pedram ... Andreas Gerstlauer
IEEE Transactions on Computers | VOL. 63
Ardavan Pedram, et. al.Ardavan Pedram ... Andreas Gerstlauer
01 Aug 2014
IEEE Transactions on Computers | VOL. 63

The Design and Implementation of the Reduction Routines in ScaLAPACK
...
Advances in Parallel Computing | VOL. 10
, et. al. ...
01 Jan 1995
Advances in Parallel Computing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting

Abstract

Talk to us

Similar Papers

More From: IEEE Access