Sparse Matrix Classification on Imbalanced Datasets Using Convolutional Neural Networks

Juan C Pichel,Beatriz Pateiro-Lopez

doi:10.1109/access.2019.2924060

Juan C Pichel, Beatriz Pateiro-Lopez

Open Access

https://doi.org/10.1109/access.2019.2924060

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 35	License type: CC BY 4.0

Affiliation: University of Santiago de Compostela

Abstract

This paper deals with the class imbalance problem in the context of the automatic selection of the best storage format for a sparse matrix with the aim of maximizing the performance of the sparse matrix vector multiplication (SpMV) on GPUs. Our classification method uses convolutional neural networks (CNNs) and proposes several solutions to mitigate the bias toward the majority classes when the data are not balanced. First, the CNNs are trained using images that represent the sparsity pattern of the matrices, whose pixels are colored according to different matrix features. In addition, we introduce a new network called SpNet, which achieves better results than a standard network as AlexNet in terms of prediction accuracy even having a more simple architecture. Finally, sampling techniques and cost-sensitive methods have been studied to give more emphasis on minority classes. The experiments conducted show that our classifiers are able to select the best performing format 92.8% of the time, obtaining 98.3% of the maximum attainable SpMV performance. A comparison to other state-of-the-art classification methods is also provided, demonstrating the benefits of our proposal.

Highlights

Sparse matrix-vector multiplication (SpMV) is considered one of the most important computational kernels lying at the heart of many scientific and engineering applications
Given that the SpMV performance depends on both the target parallel system and the sparsity structure of the matrix, many existent storage formats have focused on a particular application domain, sparsity pattern and/or computer architecture
In this paper we address the automatic classification of sparse matrices to select the best SpMV performing storage format on GPUs using convolutional neural networks (CNNs)

Summary

INTRODUCTION

Sparse matrix-vector multiplication (SpMV) is considered one of the most important computational kernels lying at the heart of many scientific and engineering applications. We assume that a large set of sparse matrices coming from different application domains and representing a variety of sparsity patterns is available This dataset is the input of the following phases: SpMV benchmarking and image generation. It is necessary to feed the CNN with a set of images labeled according to the best performing storage format (class of the matrix) This data was generated in the previous phases. C. SpMV BENCHMARKING AND IMAGE DATASET GENERATION Matrices should be labeled attending to their best storage format (class) before training a network. SpMV BENCHMARKING AND IMAGE DATASET GENERATION Matrices should be labeled attending to their best storage format (class) before training a network This goal is achieved in the SpMV benchmarking phase. Datasets consist of 256×256 images, which corresponds to the input size for the AlexNet network

NETWORKS AND TRAINING PROCESS

ADDRESSING CLASS IMBALANCE

Findings

CONCLUSIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sparse Matrix Classification on Imbalanced Datasets Using Convolutional Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Decision letter: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina ... Ricardo Azziz
-
Larisa V Suturina, et. al.Larisa V Suturina ... Ricardo Azziz
12 Dec 2022
12 Dec 2022

Author response: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Yifan Gu ... Guanqiao Shan
-
Yifan Gu, et. al.Yifan Gu ... Guanqiao Shan
12 Jan 2023
12 Jan 2023

Editor's evaluation: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina
-
Larisa V SuturinaLarisa V Suturina
12 Dec 2022
12 Dec 2022

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sparse Matrix Classification on Imbalanced Datasets Using Convolutional Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access