Real-time multi-task diffractive deep neural networks via hardware-software co-design

Yingjie Li,Berardi Sensale-Rodriguez,Weilu Gao,Ruiyang Chen,Cunxi Yu

doi:10.1038/s41598-021-90221-7

Yingjie Li, Berardi Sensale-Rodriguez + Show 3 more

Open Access

https://doi.org/10.1038/s41598-021-90221-7

Copy DOI

Journal: Scientific Reports	Publication Date: May 26, 2021
Citations: 26	License type: open-access

Affiliation: University of Utah

Abstract

Deep neural networks (DNNs) have substantial computational requirements, which greatly limit their performance in resource-constrained environments. Recently, there are increasing efforts on optical neural networks and optical computing based DNNs hardware, which bring significant advantages for deep learning systems in terms of their power efficiency, parallelism and computational speed. Among them, free-space diffractive deep neural networks (D2NNs) based on the light diffraction, feature millions of neurons in each layer interconnected with neurons in neighboring layers. However, due to the challenge of implementing reconfigurability, deploying different DNNs algorithms requires re-building and duplicating the physical diffractive systems, which significantly degrades the hardware efficiency in practical application scenarios. Thus, this work proposes a novel hardware-software co-design method that enables first-of-its-like real-time multi-task learning in D22NNs that automatically recognizes which task is being deployed in real-time. Our experimental results demonstrate significant improvements in versatility, hardware efficiency, and also demonstrate and quantify the robustness of proposed multi-task D2NN architecture under wide noise ranges of all system components. In addition, we propose a domain-specific regularization algorithm for training the proposed multi-task architecture, which can be used to flexibly adjust the desired performance for each task.

Highlights

Deep neural networks (DNNs) have substantial computational requirements, which greatly limit their performance in resource-constrained environments
The past half-decade has seen unprecedented growth in machine learning with deep neural networks (DNNs)
Even with the recent progress of integrated analog signal processors in accelerating DNNs systems which focus on accelerating matrix multiplication, such as Vector Matrix Multiplying module (VMM)[5], mixed-mode Multiplying-Accumulating unit (MAC)[6,7,8], resistive random access memory (RRAM) based M AC9–13, etc., the parallelization are still highly limited

Summary

Introduction

Deep neural networks (DNNs) have substantial computational requirements, which greatly limit their performance in resource-constrained environments. Similar to conventional DNNs, the final output class is predicted based on generating labels according to a given one-hot representation, e.g., the max operation over the output signals of the last diffractive layer observed by detectors.

Results

Conclusion