Open-Source GEMM Hardware Kernels Generator: Toward Numerically-Tailored Computations

Louis V Ledoux ,Marc Casas

doi:10.48550/arxiv.2305.18328

Open-Source GEMM Hardware Kernels Generator: Toward Numerically-Tailored Computations

Louis V Ledoux , Marc Casas

https://doi.org/10.48550/arxiv.2305.18328

Copy DOI

Journal: arXiv (Cornell University)

Publication Date: May 23, 2023

#Basic Linear Algebra Subroutine #General Matrix Multiply + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Many scientific computing problems can be reduced to Matrix-Matrix Multiplications (MMM), making the General Matrix Multiply (GEMM) kernels in the Basic Linear Algebra Subroutine (BLAS) of interest to the high-performance computing community. However, these workloads have a wide range of numerical requirements. Ill-conditioned linear systems require high-precision arithmetic to ensure correct and reproducible results. In contrast, emerging workloads such as deep neural networks, which can have millions up to billions of parameters, have shown resilience to arithmetic tinkering and precision lowering.

Full Text