Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces.

Xiang Shen,Xiang Zhang,Yifan Huang,Shuhang Chen,Yiwen Wang

doi:10.1109/tnsre.2020.3039970

Abstract

Autonomous brain machine interfaces (BMIs) aim to enable paralyzed people to self-evaluate their movement intention to control external devices. Previous reinforcement learning (RL)-based decoders interpret the mapping between neural activity and movements using the external reward for well-trained subjects, and have not investigated the task learning procedure. The brain has developed a learning mechanism to identify the correct actions that lead to rewards in the new task. This internal guidance can be utilized to replace the external reference to advance BMIs as an autonomous system. In this study, we propose to build an internally rewarded reinforcement learning-based BMI framework using the multi-site recording to demonstrate the autonomous learning ability of the BMI decoder on the new task. We test the model on the neural data collected over multiple days while the rats were learning a new lever discrimination task. The primary motor cortex (M1) and medial prefrontal cortex (mPFC) spikes are interpreted by the proposed RL framework into the discrete lever press actions. The neural activity of the mPFC post the action duration is interpreted as the internal reward information, where a support vector machine is implemented to classify the reward vs. non-reward trials with a high accuracy of 87.5% across subjects. This internal reward is used to replace the external water reward to update the decoder, which is able to adapt to the nonstationary neural activity during subject learning. The multi-cortical recording allows us to take in more cortical recordings as input and uses internal critics to guide the decoder learning. Comparing with the classic decoder using M1 activity as the only input and external guidance, the proposed system with multi-cortical recordings shows a better decoding accuracy. More importantly, our internally rewarded decoder demonstrates the autonomous learning ability on the new task as the decoder successfully addresses the time-variant neural patterns while subjects are learning, and works asymptotically as the subjects' behavioral learning progresses. It reveals the potential of endowing BMIs with autonomous task learning ability in the RL framework.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society

Lead the way for us

Journal: IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society	Publication Date: Dec 1, 2020
Citations: 40

Similar Papers

Spike prediction on primary motor cortex from medial prefrontal cortex during task learning
Shenghui Wu ... Yifan Huang
Journal of Neural Engineering | VOL. 19
Shenghui Wu, et. al.Shenghui Wu ... Yifan Huang
29 Jul 2022
Journal of Neural Engineering | VOL. 19

Editor's evaluation: Activity in developing prefrontal cortex is shaped by sleep and sensory experience
Muriel Thoby-Brisson
-
Muriel Thoby-BrissonMuriel Thoby-Brisson
12 Oct 2022
12 Oct 2022

Decision letter: Activity in developing prefrontal cortex is shaped by sleep and sensory experience
Anita Lüthi ... Laura L Colgin
-
Anita Lüthi, et. al.Anita Lüthi ... Laura L Colgin
12 Oct 2022
12 Oct 2022

Author response: Activity in developing prefrontal cortex is shaped by sleep and sensory experience
Lex J Gómez ... Mark S Blumberg
-
Lex J Gómez, et. al.Lex J Gómez ... Mark S Blumberg
16 Dec 2022
16 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society