Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection

Luis Avila,Ernesto Martínez

doi:10.1016/j.eswa.2014.04.031

Abstract

The increasing trend towards delegating tasks to autonomous artificial agents in safety–critical socio-technical systems makes monitoring an action selection policy of paramount importance. Agent behavior monitoring may profit from a stochastic specification of an optimal policy under uncertainty. A probabilistic monitoring approach is proposed to assess if an agent behavior (or policy) respects its specification. The desired policy is modeled by a prior distribution for state transitions in an optimally-controlled stochastic process. Bayesian surprise is defined as the Kullback–Leibler divergence between the state transition distribution for the observed behavior and the distribution for optimal action selection. To provide a sensitive on-line estimation of Bayesian surprise with small samples twin Gaussian processes are used. Timely detection of a deviant behavior or anomaly in an artificial pancreas highlights the sensitivity of Bayesian surprise to a meaningful discrepancy regarding the stochastic optimal policy when there exist excessive glycemic variability, sensor errors, controller ill-tuning and infusion pump malfunctioning. To reject outliers and leave out redundant information, on-line sparsification of data streams is proposed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Expert Systems With Applications	Publication Date: May 6, 2014
Citations: 6	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Salient Object Detection based on Bayesian Surprise of Restricted Boltzmann Machine
Rahul Roy ... Susmita Ghosh
-
Rahul Roy, et. al.Rahul Roy ... Susmita Ghosh
18 Dec 2018
18 Dec 2018

S154. THE ROLE OF DOPAMINE IN PROCESSING THE MEANINGFUL INFORMATION OF OBSERVATIONS, AND IMPLICATIONS FOR THE ABERRANT SALIENCE HYPOTHESIS OF SCHIZOPHRENIA
...
Schizophrenia Bulletin | VOL. 44
, et. al. ...
01 Apr 2018
Schizophrenia Bulletin | VOL. 44

Roles for globus pallidus externa revealed in a computational model of action selection in the basal ganglia
Shreyas M Suryanarayana ... Kevin N Gurney
Neural Networks | VOL. 109
Shreyas M Suryanarayana, et. al.Shreyas M Suryanarayana ... Kevin N Gurney
19 Oct 2018
Neural Networks | VOL. 109

Control system performance monitoring based on optimal action selection
Luis Ávila ... Ernesto Martínez
Computer Aided Chemical Engineering | VOL. 30
Luis Ávila, et. al.Luis Ávila ... Ernesto Martínez
01 Jan 2012
Computer Aided Chemical Engineering | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Behavior monitoring under uncertainty using Bayesian surprise and optimal action selection

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications