A Design Flow for Scheduling Spiking Deep Convolutional Neural Networks on Heterogeneous Neuromorphic System-on-Chip

Anup Das

doi:10.1145/3635032

Abstract

Neuromorphic systems-on-chip (NSoCs) integrate CPU cores and neuromorphic hardware accelerators on the same chip. These platforms can execute spiking deep convolutional neural networks (SDCNNs) with a low energy footprint. Modern NSoCs are heterogeneous in terms of their computing, communication, and storage resources. This makes scheduling SDCNN operations a combinatorial problem of exploring an exponentially-large state space in determining mapping, ordering, and timing of operations to achieve a target hardware performance, e.g., throughput. We propose a systematic design flow to schedule SDCNNs on an NSoC. Our scheduler, called SMART ( S DCNN MA pping, Orde R ing, and T iming), branches the combinatorial optimization problem into computationally-relaxed sub-problems that generate fast solutions without significantly compromising the solution quality. SMART improves performance by efficiently incorporating the heterogeneity in computing, communication, and storage resources. SMART operates in four steps. First, it creates a self-timed execution schedule to map operations to compute resources, maximizing throughput. Second, it uses an optimization strategy to distribute activation and synaptic weights to storage resources, minimizing data communication-related overhead. Third, it constructs an inter-processor communication (IPC) graph with a transaction order for its communication actors. This transaction order is created using a transaction partial order algorithm, which minimizes contention on the shared communication resources. Finally, it schedules this IPC graph to hardware by overlapping communication with the computation, and leveraging operation, pipeline, and batch parallelism. We evaluate SMART using 10 representative image, object, and language-based SDCNNs. Results show that SMART increases throughput by an average 23%, compared to a state-of-the-art scheduler. SMART is implemented entirely in software as a compiler extension. It doesn’t require any change in a neuromorphic hardware or its interface to CPUs. It improves throughput with only a marginal increase in the compilation time. SMART is released under the open-source MIT licensing at https://github.com/drexel-DISCO/SMART to foster future research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Design Flow for Scheduling Spiking Deep Convolutional Neural Networks on Heterogeneous Neuromorphic System-on-Chip

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Similar Papers

DAARM
Andreas Weichslgartner ... Michael Glaß
-
Andreas Weichslgartner, et. al.Andreas Weichslgartner ... Michael Glaß
12 Oct 2014
12 Oct 2014

State-based real-time analysis of SDF applications on MPSoCs with shared communication resources
Maher Fakih ... Achim Rettberg
Journal of Systems Architecture | VOL. 61
Maher Fakih, et. al.Maher Fakih ... Achim Rettberg
24 Apr 2015
Journal of Systems Architecture | VOL. 61

Analysis of distributed control systems with shared communication and computation resources
Payam Naghshtabrizi ... Joao P Hespanha
-
Payam Naghshtabrizi, et. al.Payam Naghshtabrizi ... Joao P Hespanha
01 Jan 2009
01 Jan 2009

Approximation algorithms for data-intensive service chain embedding
Konstantinos Poularakis ... Antonia M Tulino
-
Konstantinos Poularakis, et. al.Konstantinos Poularakis ... Antonia M Tulino
11 Oct 2020
11 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Design Flow for Scheduling Spiking Deep Convolutional Neural Networks on Heterogeneous Neuromorphic System-on-Chip

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems