A dual-memory architecture for reinforcement learning on neuromorphic platforms

Wilkie Olin-Ammentorp,Yury Sokolov,Maxim Bazhenov

doi:10.1088/2634-4386/ac1a64

Abstract

Reinforcement learning (RL) is a foundation of learning in biological systems and provides a framework to address numerous challenges with real-world artificial intelligence applications. Efficient implementations of RL techniques could allow for agents deployed in edge-use cases to gain novel abilities, such as improved navigation, understanding complex situations and critical decision making. Toward this goal, we describe a flexible architecture to carry out RL on neuromorphic platforms. This architecture was implemented using an Intel neuromorphic processor and demonstrated solving a variety of tasks using spiking dynamics. Our study proposes a usable solution for real-world RL applications and demonstrates applicability of the neuromorphic platforms for RL problems.

Highlights

As the number of data-collecting devices increases, so too does the need for efficient data processing
Dual-memory learner (DML) framework Monte Carlo (MC) methods provide well-characterized Reinforcement learning (RL) techniques for learning optimal policies via episodic experiences; the agent does not need to be equipped with a full model of how the environment will react to its actions in order to learn
It is crucial for neuromorphic systems to show that they are capable of RL techniques and can demonstrate advantages for these techniques over traditional hardware

Summary

Introduction

As the number of data-collecting devices increases, so too does the need for efficient data processing. Rather than require all data collected from remote devices be processed at a central location, the need for data processing to be performed in-situ is becoming a priority; this is especially true in situations where ‘agents’ collecting data may need to make critical decisions based on these inputs with low latency (such as in self-driving cars or aerial drones) For such use cases, efficiency of data processing becomes paramount, as energy sources and physical space (‘size, weight, and power’) come at a premium[1]. There is no universal definition on what constitutes a neuromorphic architecture, these systems generally aim to provide efficient, massively-parallel processing schemes which often use binary ‘spikes’ to transmit information[2]. We utilize Intel’s neuromorphic processor codenamed ‘Loihi.’[4]

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neuromorphic Computing and Engineering	Publication Date: Sep 9, 2021
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

A dual-memory architecture for reinforcement learning on neuromorphic platforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neuromorphic Computing and Engineering

Lead the way for us

Similar Papers

Learning and optimization in biological systems
D Teodorescu
Biological Cybernetics | VOL. 28
D TeodorescuD Teodorescu
01 Dec 1977
Biological Cybernetics | VOL. 28

Biped dynamic walking using reinforcement learning
Hamid Benbrahim ... Judy A Franklin
Robotics | VOL. 22
Hamid Benbrahim, et. al.Hamid Benbrahim ... Judy A Franklin
01 Dec 1997
Robotics | VOL. 22

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Interdisciplinary Approaches to Robot Learning
J Demiris ... A Birk
-
J Demiris, et. al.J Demiris ... A Birk
01 Jun 2000
01 Jun 2000

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A dual-memory architecture for reinforcement learning on neuromorphic platforms

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neuromorphic Computing and Engineering