Photonic Reconfigurable Accelerators for Efficient Inference of CNNs With Mixed-Sized Tensors

Sairam Sri Vatsavai,Ishan G Thakkar

doi:10.1109/tcad.2022.3197538

Abstract

Photonic microring resonator (MRR)-based hardware accelerators have been shown to provide disruptive speedup and energy-efficiency improvements for processing deep convolutional neural networks (CNNs). However, previous MRR-based CNN accelerators fail to provide efficient adaptability for CNNs with mixed-sized tensors. One example of such CNNs is depthwise separable CNNs. Performing inferences of CNNs with mixed-sized tensors on such inflexible accelerators often leads to low hardware utilization, which diminishes the achievable performance and energy efficiency from the accelerators. In this article, we present a novel way of introducing reconfigurability in the MRR-based CNN accelerators, to enable dynamic maximization of the size compatibility between the accelerator hardware components and the CNN tensors that are processed using the hardware components. We classify the state-of-the-art MRR-based CNN accelerators from prior works into two categories, based on the layout and relative placements of the utilized hardware components in the accelerators. We then use our method to introduce reconfigurability in accelerators from these two classes, to consequently improve their parallelism, the flexibility of efficiently mapping tensors of different sizes, speed, and overall energy efficiency. We evaluate our reconfigurable accelerators against three prior works for the area proportionate outlook (equal hardware area for all accelerators). Our evaluation for the inference of four modern CNNs indicates that our designed reconfigurable CNN accelerators provide improvements of up to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.8\times $ </tex-math></inline-formula> in frames-per-second (FPS) and up to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$1.5\times $ </tex-math></inline-formula> in FPS/W, compared to an MRR-based accelerator from prior work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Photonic Reconfigurable Accelerators for Efficient Inference of CNNs With Mixed-Sized Tensors

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Nov 1, 2022
Citations: 4

Similar Papers

CPU-Accelerator Co-Scheduling for CNN Acceleration at the Edge
Yeongmin Kim ... Arslan Munir
IEEE Access | VOL. 8
Yeongmin Kim, et. al.Yeongmin Kim ... Arslan Munir
01 Jan 2020
IEEE Access | VOL. 8

Revealing CNN Architectures via Side-Channel Analysis in Dataflow-based Inference Accelerators
Hansika Weerasena ... Prabhat Mishra
ACM Transactions on Embedded Computing Systems | VOL. -
Hansika Weerasena, et. al.Hansika Weerasena ... Prabhat Mishra
12 Aug 2024
ACM Transactions on Embedded Computing Systems | VOL. -

SPRING: A Sparsity-Aware Reduced-Precision Monolithic 3D CNN Accelerator Architecture for Training and Inference
Ye Yu ... Niraj K. Jha
IEEE Transactions on Emerging Topics in Computing | VOL. 10
Ye Yu, et. al.Ye Yu ... Niraj K. Jha
24 Jun 2020
IEEE Transactions on Emerging Topics in Computing | VOL. 10

ATRIA: A Bit-Parallel Stochastic Arithmetic Based Accelerator for In-DRAM CNN Processing
Supreeth Mysore Shivanandamurthy ... Ishan G Thakkar
-
Supreeth Mysore Shivanandamurthy, et. al.Supreeth Mysore Shivanandamurthy ... Ishan G Thakkar
01 Jul 2021
01 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Photonic Reconfigurable Accelerators for Efficient Inference of CNNs With Mixed-Sized Tensors

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems