The value–complexity trade-off for reinforcement learning based brain–computer interfaces

Hadar Levi-Aharoni,Naftali Tishby

doi:10.1088/1741-2552/abc8d8

Abstract

Objective. One of the recent developments in the field of brain–computer interfaces (BCI) is the reinforcement learning (RL) based BCI paradigm, which uses neural error responses as the reward feedback on the agent’s action. While having several advantages over motor imagery based BCI, the reliability of RL-BCI is critically dependent on the decoding accuracy of noisy neural error signals. A principled method is needed to optimally handle this inherent noise under general conditions. Approach. By determining a trade-off between the expected value and the informational cost of policies, the info-RL (IRL) algorithm provides optimal low-complexity policies, which are robust under noisy reward conditions and achieve the maximal obtainable value. In this work we utilize the IRL algorithm to characterize the maximal obtainable value under different noise levels, which in turn is used to extract the optimal robust policy for each noise level. Main results. Our simulation results of a setting with Gaussian noise show that the complexity level of the optimal policy is dependent on the reward magnitude but not on the reward variance, whereas the variance determines whether a lower complexity solution is favorable or not. We show how this analysis can be utilized to select optimal robust policies for an RL-BCI and demonstrate its use on EEG data. Significance. We propose here a principled method to determine the optimal policy complexity of an RL problem with a noisy reward, which we argue is particularly useful for RL-based BCI paradigms. This framework may be used to minimize initial training time and allow for a more dynamic and robust shared control between the agent and the operator under different conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Neural Engineering	Publication Date: Dec 1, 2020
Citations: 1	License type: iop-standard

R Discovery Prime

R Discovery Prime

The value–complexity trade-off for reinforcement learning based brain–computer interfaces

Abstract

Talk to us

Similar Papers

More From: Journal of Neural Engineering

Lead the way for us

Similar Papers

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

A novel Brain Computer Interface for classification of social joint attention in autism and comparison of 3 experimental setups: A feasibility study
Carlos P Amaral ... Miguel Castelo-Branco
Journal of Neuroscience Methods | VOL. 290
Carlos P Amaral, et. al.Carlos P Amaral ... Miguel Castelo-Branco
29 Jul 2017
Journal of Neuroscience Methods | VOL. 290

Airborne Ultrasonic Tactile Display BCI
Katsuhiko Hamada ... Hiroyuki Shinoda
-
Katsuhiko Hamada, et. al.Katsuhiko Hamada ... Hiroyuki Shinoda
01 Jan 2015
01 Jan 2015

Interface, interaction, and intelligence in generalized brain–computer interfaces
Xiaorong Gao ... Xiaogang Chen
Trends in Cognitive Sciences | VOL. 25
Xiaorong Gao, et. al.Xiaorong Gao ... Xiaogang Chen
01 Aug 2021
Trends in Cognitive Sciences | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The value–complexity trade-off for reinforcement learning based brain–computer interfaces

Abstract

Talk to us

Similar Papers

More From: Journal of Neural Engineering