Robust risk-averse multi-armed bandits with application in social engagement behavior of children with autism spectrum disorder while imitating a humanoid robot

Azra Aryania,Hadi S Aghdasi,Rasoul Heshmati,Andrea Bonarini

doi:10.1016/j.ins.2021.05.067

Abstract

The stochastic multi-armed bandit problem is a standard model to solve the exploration–exploitation trade-off in sequential decision problems. In clinical trials, which are sensitive to outlier data, the goal is to learn a risk-averse policy to provide a trade-off between exploration, exploitation, and safety. In this paper, we present a risk-averse multi-armed bandit algorithm to solve a decision-making problem based on the social engagement behaviors of children with Autism Spectrum Disorder (ASD). The algorithm is carried out when children interact with a humanoid robot and imitate a sequence of the robot's movements. The proposed algorithm is based on the Best Empirical Sampled Average algorithm under Entropic Value-at-Risk as a risk measure to decide on the best sequence of movements that can improve the social engagement behaviors of the children with ASD while imitating the robot's movements. We provide a detailed experimental analysis to compare the performance of our proposed algorithm to some well-known risk-averse multi-armed bandit algorithms on some artificial scenarios and our real-world problem. The experimental results report that the proposed algorithm outperforms its competitors in terms of robustness, risk avoidance, and cumulative regret, promoting the social engagement behaviors of children with ASD when imitating a robot's movements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust risk-averse multi-armed bandits with application in social engagement behavior of children with autism spectrum disorder while imitating a humanoid robot

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: May 29, 2021
Citations: 3

Similar Papers

Adaptive Exploration in Stochastic Multi-armed Bandit Problem
Xiaofang Zhang ... Quan Liu
-
Xiaofang Zhang, et. al.Xiaofang Zhang ... Quan Liu
27 Dec 2016
27 Dec 2016

An Optimal Algorithm for the Stochastic Bandits While Knowing the Near-Optimal Mean Reward.
Shangdong Yang ... Yang Gao
IEEE transactions on neural networks and learning systems | VOL. 32
Shangdong Yang, et. al.Shangdong Yang ... Yang Gao
01 May 2021
IEEE transactions on neural networks and learning systems | VOL. 32

The promise of precision medicine in autism
Ana Kostic ... Joseph D Buxbaum
Neuron | VOL. 109
Ana Kostic, et. al.Ana Kostic ... Joseph D Buxbaum
01 Jul 2021
Neuron | VOL. 109

Teaching social behavior to children with autism spectrum disorders using Social StoriesTM: Implications for school-based practice.
Frank J Sansosti
The Journal of Speech and Language Pathology – Applied Behavior Analysis | VOL. 4
Frank J SansostiFrank J Sansosti
01 Jan 2009
The Journal of Speech and Language Pathology – Applied Behavior Analysis | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust risk-averse multi-armed bandits with application in social engagement behavior of children with autism spectrum disorder while imitating a humanoid robot

Abstract

Talk to us

Similar Papers

More From: Information Sciences