STADIA: Photonic Stochastic Gradient Descent for Neural Network Accelerators

Chengpeng Xia,Haibo Zhang,Yawen Chen,Jigang Wu

doi:10.1145/3607920

Abstract

Deep Neural Networks (DNNs) have demonstrated great success in many fields such as image recognition and text analysis. However, the ever-increasing sizes of both DNN models and training datasets make deep leaning extremely computation- and memory-intensive. Recently, photonic computing has emerged as a promising technology for accelerating DNNs. While the design of photonic accelerators for DNN inference and forward propagation of DNN training has been widely investigated, the architectural acceleration for equally important backpropagation of DNN training has not been well studied. In this paper, we propose a novel silicon photonic-based backpropagation accelerator for high performance DNN training. Specifically, a general-purpose photonic gradient descent unit named STADIA is designed to implement the multiplication, accumulation, and subtraction operations required for computing gradients using mature optical devices including Mach-Zehnder Interferometer (MZI) and Mircoring Resonator (MRR), which can significantly reduce the training latency and improve the energy efficiency of backpropagation. To demonstrate efficient parallel computing, we propose a STADIA-based backpropagation acceleration architecture and design a dataflow by using wavelength-division multiplexing (WDM). We analyze the precision of STADIA by quantifying the precision limitations imposed by losses and noises. Furthermore, we evaluate STADIA with different element sizes by analyzing the power, area and time delay for photonic accelerators based on DNN models such as AlexNet, VGG19 and ResNet. Simulation results show that the proposed architecture STADIA can achieve significant improvement by 9.7× in time efficiency and 147.2× in energy efficiency, compared with the most advanced optical-memristor based backpropagation accelerator.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

STADIA: Photonic Stochastic Gradient Descent for Neural Network Accelerators

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems

Lead the way for us

Similar Papers

A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
Bontak Gu ... Young Geun Kim
-
Bontak Gu, et. al.Bontak Gu ... Young Geun Kim
01 Dec 2019
01 Dec 2019

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges
Edgar Galvan ... Peter Mooney
IEEE Transactions on Artificial Intelligence | VOL. 2
Edgar Galvan, et. al.Edgar Galvan ... Peter Mooney
04 May 2021
IEEE Transactions on Artificial Intelligence | VOL. 2

PipePar: Enabling fast DNN pipeline parallel training in heterogeneous GPU clusters
Jinghui Zhang ... Zhiang Wu
Neurocomputing | VOL. 555
Jinghui Zhang, et. al.Jinghui Zhang ... Zhiang Wu
04 Aug 2023
Neurocomputing | VOL. 555

A Guessing Entropy-Based Framework for Deep Learning-Assisted Side-Channel Analysis
Ziyue Zhang ... Yunsi Fei
IEEE Transactions on Information Forensics and Security | VOL. 18
Ziyue Zhang, et. al.Ziyue Zhang ... Yunsi Fei
01 Jan 2023
IEEE Transactions on Information Forensics and Security | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

STADIA: Photonic Stochastic Gradient Descent for Neural Network Accelerators

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Embedded Computing Systems