Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs

Hartwig Anzt,Eduardo Ponce,Moritz Kreutzer,Jack Dongarra,Gerhard Wellein,Gregory D Peterson

doi:10.1177/1094342016646844

Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs

Hartwig Anzt, Eduardo Ponce + Show 4 more

Open Access

https://doi.org/10.1177/1094342016646844

Copy DOI

Journal: The International Journal of High Performance Computing Applications	Publication Date: May 5, 2016
Citations: 5	License type: other-oa

Affiliation: University of Tennessee at Knoxville, University of Erlangen-Nuremberg, Oak Ridge National Laboratory, University of Manchester

#Improve Data Locality #Comprehensive Performance Evaluation + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we present an optimized GPU implementation for the induced dimension reduction algorithm. We improve data locality, combine it with an efficient sparse matrix vector kernel, and investigate the potential of overlapping computation with communication as well as the possibility of concurrent kernel execution. A comprehensive performance evaluation is conducted using a suitable performance model. The analysis reveals efficiency of up to 90%, which indicates that the implementation achieves performance close to the theoretically attainable bound.

Full Text