Multithreaded Layer-wise Training of Sparse Deep Neural Networks using Compressed Sparse Column

Mohammad Hasanzadeh Mofrad,Rami Melhem,Mohammad Hammoud,Yousuf Ahmad

doi:10.1109/hpec.2019.8916494

Mohammad Hasanzadeh Mofrad, Rami Melhem + Show 2 more

https://doi.org/10.1109/hpec.2019.8916494

Copy DOI

Abstract

Training a sparse Deep Neural Network (DNN) is inherently less memory-intensive and processor-intensive compared to training a dense (fully-connected) DNN. In this paper, we utilize Sparse Matrix-Matrix Multiplication (SpMM) to train sparsely-connected DNNs as opposed to dense matrix-matrix multiplication used for training dense DNNs. In our C/C++ implementation, we extensively use in-memory Compressed Sparse Column (CSC) data structures to store and traverse the neural network layers. Also, we train the neural network layer by layer, and within each layer we use 1D-Column partitioning to divide the computation required for training among threads. To speedup the computation, we apply the bias and activation functions while executing SpMM operations. We tested our implementation using benchmarks provided by MIT/IEEE/Amazon HPEC graph challenge [1]. Based on our results, our single thread (1 core) and multithreaded (12 cores) implementations are up to $22 \times$, and $150 \times$ faster than the serial Matlab results provided by the challenge. We believe this speedup is due to the 1D-Column partitioning that we use to balance the computation of SpMM operations among computing threads, the efficient mechanism that we use for memory (re)allocation of sparse matrices, and the overlapping of the accumulation of SpMM results with the application of the bias and activation functions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multithreaded Layer-wise Training of Sparse Deep Neural Networks using Compressed Sparse Column

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Studying the Effects of Hashing of Sparse Deep Neural Networks on Data and Model Parallelisms
Mohammad Hasanzadeh Mofrad ... Yousuf Ahmad
-
Mohammad Hasanzadeh Mofrad, et. al.Mohammad Hasanzadeh Mofrad ... Yousuf Ahmad
22 Sep 2020
22 Sep 2020

Scalable Inference for Sparse Deep Neural Networks using Kokkos Kernels
J Austin Ellis ... Sivasankaran Rajamanickam
-
J Austin Ellis, et. al.J Austin Ellis ... Sivasankaran Rajamanickam
01 Sep 2019
01 Sep 2019

Performance of Training Sparse Deep Neural Networks on GPUs
Jianzong Wang ... Jing Xiao
-
Jianzong Wang, et. al.Jianzong Wang ... Jing Xiao
01 Sep 2019
01 Sep 2019

Neural Network Topologies for Sparse Training
Ryan A Robinett ... Jeremy Kepner
-
Ryan A Robinett, et. al.Ryan A Robinett ... Jeremy Kepner
05 Oct 2018
05 Oct 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multithreaded Layer-wise Training of Sparse Deep Neural Networks using Compressed Sparse Column

Abstract

Talk to us

Similar Papers