On-chip trainable hardware-based deep Q-networks approximating a backpropagation algorithm

Jangsaeng Kim,Soochang Lee,Sung Yun Woo,Seongbin Oh,Dongseok Kwon,Chul-Heung Kim,Jong-Ho Lee,Byung-Gook Park,Jong-Ho Bae,Won-Mook Kang

doi:10.1007/s00521-021-05699-z

Jangsaeng Kim, Soochang Lee + Show 8 more

Open Access

https://doi.org/10.1007/s00521-021-05699-z

Copy DOI

Abstract

Reinforcement learning (RL) using deep Q-networks (DQNs) has shown performance beyond the human level in a number of complex problems. In addition, many studies have focused on bio-inspired hardware-based spiking neural networks (SNNs) given the capabilities of these technologies to realize both parallel operation and low power consumption. Here, we propose an on-chip training method for DQNs applicable to hardware-based SNNs. Because the conventional backpropagation (BP) algorithm is approximated, a performance evaluation based on two simple games shows that the proposed system achieves performance similar to that of a software-based system. The proposed training method can minimize memory usage and reduce power consumption and area occupation levels. In particular, for simple problems, the memory dependency can be significantly reduced given that high performance is achieved without using replay memory. Furthermore, we investigate the effect of the nonlinearity characteristics and two types of variation of non-ideal synaptic devices on the performance outcomes. In this work, thin-film transistor (TFT)-type flash memory cells are used as synaptic devices. A simulation is also conducted using fully connected neural network with non-leaky integrated-and-fire (I&F) neurons. The proposed system shows strong immunity to device variations because an on-chip training scheme is adopted.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neural Computing and Applications	Publication Date: Feb 10, 2021
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

On-chip trainable hardware-based deep Q-networks approximating a backpropagation algorithm

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Similar Papers

On-Chip Training Spiking Neural Networks Using Approximated Backpropagation With Analog Synaptic Devices.
Dongseok Kwon ... Seongbin Oh
Frontiers in Neuroscience | VOL. 14
Dongseok Kwon, et. al.Dongseok Kwon ... Seongbin Oh
07 Jul 2020
Frontiers in Neuroscience | VOL. 14

Hardware-based spiking neural network architecture using simplified backpropagation algorithm and homeostasis functionality
Jangsaeng Kim ... Jong-Ho Lee
Neurocomputing | VOL. 428
Jangsaeng Kim, et. al.Jangsaeng Kim ... Jong-Ho Lee
10 Dec 2020
Neurocomputing | VOL. 428

Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks
Weihao Tan ... Robert Kozma
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Weihao Tan, et. al.Weihao Tan ... Robert Kozma
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

EETS: An energy-efficient task scheduler in cloud computing based on improved DQN algorithm
Huanhuan Hou ... Azlan Ismail
Journal of King Saud University - Computer and Information Sciences | VOL. 36
Huanhuan Hou, et. al.Huanhuan Hou ... Azlan Ismail
31 Aug 2024
Journal of King Saud University - Computer and Information Sciences | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On-chip trainable hardware-based deep Q-networks approximating a backpropagation algorithm

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications