Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction

George Velentzas,Theodore Tsitsimis,Costas Tzafestas,Mehdi Khamassi,Iñaki Rañó

doi:10.1515/pjbr-2018-0016

Abstract

AbstractUsing assistive robots for educational applications requires robots to be able to adapt their behavior specifically for each child with whom they interact.Among relevant signals, non-verbal cues such as the child’s gaze can provide the robot with important information about the child’s current engagement in the task, and whether the robot should continue its current behavior or not. Here we propose a reinforcement learning algorithm extended with active state-specific exploration and show its applicability to child engagement maximization as well as more classical tasks such as maze navigation. We first demonstrate its adaptive nature on a continuous maze problem as an enhancement of the classic grid world. There, parameterized actions enable the agent to learn single moves until the end of a corridor, similarly to “options” but without explicit hierarchical representations.We then apply the algorithm to a series of simulated scenarios, such as an extended Tower of Hanoi where the robot should find the appropriate speed of movement for the interacting child, and to a pointing task where the robot should find the child-specific appropriate level of expressivity of action. We show that the algorithm enables to cope with both global and local non-stationarities in the state space while preserving a stable behavior in other stationary portions of the state space. Altogether, these results suggest a promising way to enable robot learning based on non-verbal cues and the high degree of non-stationarities that can occur during interaction with children.

Highlights

Using assistive robots for educational applications requires robots to be able to adapt their behavior for each child with whom they interact
In [15, 16], we have previously proposed to apply the framework of Parameterized Action Space Markov Decision Processes (PAMDP) [17, 18] to human-robot interaction because it constitutes a promising intermediate solution between discrete and continuous action learning
This paradigm follows the objectives defined in the framework of the EU-funded project BabyRobot (H2020-ICT-24-20156878310), where a set of child-robot interaction use-cases have been designed and implemented to study the development of specific socio-affective, communication and collaborative skills in typical and Autistic Spectrum Disorders (ASD) children

Summary

Introduction

Abstract: Using assistive robots for educational applications requires robots to be able to adapt their behavior for each child with whom they interact. We propose a reinforcement learning algorithm extended with active state-specific exploration and show its applicability to child engagement maximization as well as more classical tasks such as maze navigation. We first demonstrate its adaptive nature on a continuous maze problem as an enhancement of the classic grid world. We apply the algorithm to a series of simulated scenarios, such as an extended Tower of Hanoi where the robot should find the appropriate speed of movement for the interacting child, and to a pointing task where the robot should find the child-specific appropriate level of expressivity of action

Objectives

Methods

Results

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Paladyn, Journal of Behavioral Robotics	Publication Date: Aug 1, 2018
Citations: 8	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Paladyn, Journal of Behavioral Robotics

Lead the way for us

Similar Papers

Adaptive Multiagent Reinforcement Learning with Non-positive Regret
Duong D Nguyen ... Hung X Nguyen
-
Duong D Nguyen, et. al.Duong D Nguyen ... Hung X Nguyen
01 Jan 2015
01 Jan 2015

Robot Fast Adaptation to Changes in Human Engagement During Simulated Dynamic Social Interaction With Active Exploration in Parameterized Reinforcement Learning
Mehdi Khamassi ... Costas Tzafestas
IEEE Transactions on Cognitive and Developmental Systems | VOL. 10
Mehdi Khamassi, et. al.Mehdi Khamassi ... Costas Tzafestas
01 Dec 2018
IEEE Transactions on Cognitive and Developmental Systems | VOL. 10

Balancing Value Iteration and Policy Iteration for Discrete-Time Control
Biao Luo ... Huai-Ning Wu
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 50
Biao Luo, et. al.Biao Luo ... Huai-Ning Wu
11 Mar 2019
IEEE Transactions on Systems, Man, and Cybernetics: Systems | VOL. 50

Elastic management of cloud applications using adaptive reinforcement learning
Konstantinos Lolos ... Nectarios Koziris
-
Konstantinos Lolos, et. al.Konstantinos Lolos ... Nectarios Koziris
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Paladyn, Journal of Behavioral Robotics