ReLAccS: A Multilevel Approach to Accelerator Design for Reinforcement Learning on FPGA-Based Systems

Akhil Raj Baranwal,Salim Ullah,Akash Kumar,Siva Satyendra Sahoo

doi:10.1109/tcad.2020.3028350

Abstract

Reinforcement learning (RL), specifically Q-learning, with human-like learning abilities to learn from experience without any a priori data, is being increasingly used in embedded systems in the field of control and navigation. However, finding the optimal policy in this approach can be highly compute-intensive, and a software-only implementation may not satisfy the application's timing constraints. To this end, we propose optimization methods at multiple levels of accelerator design for RL. Specifically, at the architecture-level, we exploit the instruction-level parallelism and the spatial parallelism in FPGAs to improve the throughput over state-of-the-art designs by up to 34%. Further, we propose lookup table-level optimizations to reduce the resource utilization and power dissipation of the accelerator. Finally, we propose algorithm-level approximation that can be used for acceleration of Q-learning problems with more states and for reducing the peak power dissipation. We report up to 10× reduction in power dissipation with marginal degradation in quality of results.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ReLAccS: A Multilevel Approach to Accelerator Design for Reinforcement Learning on FPGA-Based Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Oct 2, 2020
Citations: 8

Similar Papers

Voltage Based Energy Efficient Mobile Charge Sensor Design Using LVCMOS
Arpit Gupta ... Aarushi Sapra
International Journal of Smart Home | VOL. 10
Arpit Gupta, et. al.Arpit Gupta ... Aarushi Sapra
28 Feb 2016
International Journal of Smart Home | VOL. 10

Reduction of Power Dissipation in Dynamic BiCMOS Logic Gates by Transistor Reordering
S M Rezaul Hasan ... Yufridin Wahab
VLSI Design | VOL. 15
S M Rezaul Hasan, et. al.S M Rezaul Hasan ... Yufridin Wahab
01 Jan 1999
VLSI Design | VOL. 15

Τεχνικές ελέγχου ορθής λειτουργίας με έμφαση στη χαμηλή κατανάλωση ισχύος
Μάτσιεϊ Μπέλλος
-
Μάτσιεϊ ΜπέλλοςΜάτσιεϊ Μπέλλος
21 Jul 2005
21 Jul 2005

Energy efficient flip flop design using voltage scaling on FPGA
Sunny Singh ... Amanpreet Kaur
-
Sunny Singh, et. al.Sunny Singh ... Amanpreet Kaur
01 Dec 2014
01 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ReLAccS: A Multilevel Approach to Accelerator Design for Reinforcement Learning on FPGA-Based Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems