Input Sparse Matrix Research Articles

The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed to improve this kernel on the recent GPU architectures. However, it has been widely observed that there is no “best-for-all” sparse format for the SpMV kernel on GPU. Indeed, serious performance degradation of an order of magnitude can be observed without a careful selection of the sparse format to use. To address this problem, we propose in this article BestSF (Best Sparse Format), a new learning-based sparse meta-format that automatically selects the most appropriate sparse format for a given input matrix. To do so, BestSF relies on a cost-sensitive classification system trained using Weighted Support Vector Machines (WSVMs) to predict the best sparse format for each input sparse matrix. Our experimental results on two different NVIDIA GPU architectures using a large number of real-world sparse matrices show that BestSF achieved a noticeable overall performance improvement over using a single sparse format. While BestSF is trained to select the best sparse format in terms of performance (GFLOPS), our further experimental investigations revealed that using BestSF also led, in most of the test cases, to the best energy efficiency (MFLOPS/W). To prove its practical effectiveness, we also evaluate the performance and energy efficiency improvement achieved when using BestSF as a building block in a GPU-based Preconditioned Conjugate Gradient (PCG) iterative solver.

Read full abstract

Minimal elimination orderings were introduced by Rose, Tarjan, and Lueker in 1976, and during the last decade they have received increasing attention. Such orderings have important applications in several different fields, and they were first studied in connection with minimizing fill in sparse matrix computations. Rather than computing any minimal ordering, which might result in fill that is far from minimum, it is more desirable for practical applications to start from an ordering produced by a fill-reducing heuristic and then compute a minimal fill that is a subset of the fill produced by the given heuristic. This problem has been addressed previously, and there are several algorithms for solving it. The drawback of these algorithms is that either there is no theoretical bound given on their running time, although they might run fast in practice, or they have a good theoretical running time, but they have never been implemented, or they require a large machinery of complicated data structures to achieve the good theoretical time bound. In this paper, we present an algorithm called MCS-ETree for solving the mentioned problem in $O(nm A(m,n))$ time, where m and n are, respectively, the number of edges and vertices of the graph corresponding to the input sparse matrix and $A(m,n)$ is the very slowly growing inverse of Ackerman's function. A primary strength of MCS-ETree is its simplicity and its straightforward implementation details. We present run time test results to show that our algorithm is fast in practice. Thus our algorithm is the first that both has a provably good running time with easy implementation details and is fast in practice.

Read full abstract

Input Sparse Matrix Research Articles

Related Topics

Articles published on Input Sparse Matrix

Newly Released Capabilities in the Distributed-Memory SuperLU Sparse Direct Solver

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms

BestSF

Design and Implementation of Adaptive SpMV Library for Multicore and Many-Core Architecture

SMAT

Fast Computation of Minimal Fill Inside A Given Elimination Ordering

Approaches Based on Permutations for Partitioning Sparse Matrices on Multiprocessors

Solving sparse linear systems of equations using the modified digraph approach

Implementation and evaluation of a communication intensive application on the EARTH multithreaded system

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Input Sparse Matrix Research Articles

Related Topics

Articles published on Input Sparse Matrix

Newly Released Capabilities in the Distributed-Memory SuperLU Sparse Direct Solver

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms

BestSF

Design and Implementation of Adaptive SpMV Library for Multicore and Many-Core Architecture

SMAT

Fast Computation of Minimal Fill Inside A Given Elimination Ordering

Approaches Based on Permutations for Partitioning Sparse Matrices on Multiprocessors

Solving sparse linear systems of equations using the modified digraph approach

Implementation and evaluation of a communication intensive application on the EARTH multithreaded system