Evaluation of DGEMM Implementation on Intel Xeon Phi Coprocessor

Pawel Gepner,Eric Houdard,Damien Declat,Victor Gamayunov,David L Fraser,Mathieu Dubois,Ludovic Sauge

doi:10.4304/jcp.9.7.1566-1571

Evaluation of DGEMM Implementation on Intel Xeon Phi Coprocessor

Pawel Gepner, Eric Houdard + Show 5 more

https://doi.org/10.4304/jcp.9.7.1566-1571

Copy DOI

Journal: Journal of Computers	Publication Date: Jan 7, 2014
Citations: 5

#Intel Xeon Phi #Intel Math Kernel Library + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper we will present a detailed study of implementing double-precision matrix-matrix multiplication (DGEMM) utilizing the Intel Xeon Phi Coprocessor. We discuss a DGEMM algorithm implementation running on the coprocessor, minimizing communication with the host CPU. We will run DGEMM across a range of matrix sizes natively as well using Intel Math Kernel Library. Our optimizations were designed to support maximal reuse of on-die cache, which significantly reduces transfer from GDDR. Finally we analyze the improvement of a classic matrix multiplication implementation based on Cauchy algorithm compared to the latest results achieved using the Intel Math Kernel Library DGEMM subroutine.

Full Text