Lifelong Personalization via Gaussian Process Modeling for Long-Term HRI.

Samuel Spaulding,Hae Won Park,Jocelyn Shen,Cynthia Breazeal

doi:10.3389/frobt.2021.683066

Samuel Spaulding, Hae Won Park + Show 2 more

Open Access

https://doi.org/10.3389/frobt.2021.683066

Copy DOI

Abstract

Across a wide variety of domains, artificial agents that can adapt and personalize to users have potential to improve and transform how social services are provided. Because of the need for personalized interaction data to drive this process, long-term (or longitudinal) interactions between users and agents, which unfold over a series of distinct interaction sessions, have attracted substantial research interest. In recognition of the expanded scope and structure of a long-term interaction, researchers are also adjusting the personalization models and algorithms used, orienting toward “continual learning” methods, which do not assume a stationary modeling target and explicitly account for the temporal context of training data. In parallel, researchers have also studied the effect of “multitask personalization,” an approach in which an agent interacts with users over multiple different tasks contexts throughout the course of a long-term interaction and learns personalized models of a user that are transferrable across these tasks. In this paper, we unite these two paradigms under the framework of “Lifelong Personalization,” analyzing the effect of multitask personalization applied to dynamic, non-stationary targets. We extend the multi-task personalization approach to the more complex and realistic scenario of modeling dynamic learners over time, focusing in particular on interactive scenarios in which the modeling agent plays an active role in teaching the student whose knowledge the agent is simultaneously attempting to model. Inspired by the way in which agents use active learning to select new training data based on domain context, we augment a Gaussian Process-based multitask personalization model with a mechanism to actively and continually manage its own training data, allowing a modeling agent to remove or reduce the weight of observed data from its training set, based on interactive context cues. We evaluate this method in a series of simulation experiments comparing different approaches to continual and multitask learning on simulated student data. We expect this method to substantially improve learning in Gaussian Process models in dynamic domains, establishing Gaussian Processes as another flexible modeling tool for Long-term Human-Robot Interaction (HRI) Studies.

Highlights

Our goal is to develop adaptive robotic agents that are deeply personalized and designed for long-term interaction
Each of these paradigm shifts in interaction design are mirrored by an associated paradigm shift in algorithm and model design: continual learning, which accounts for the temporal sequence in which data is received and assumes a dynamic or non-stationary modeling target, and transfer learning which accounts for the task in which training data originated and uses data from one “source” task to more quickly learn a model in a different “target” task
To improve the ability of the Gaussian Process (GP)-based STUDENTMODELs to handle lifelong personalization, we introduce a continual active training data management (CATDaM) mechanism that allows the tutoring agent to proactively manage its training data through a novel two-way active learning protocol, enabling the model to both select new data points to add to its training set and automatically prune its existing training set to remove “stale” data points that may no longer be a good representation of the dynamic student

Summary

INTRODUCTION

Our goal is to develop adaptive robotic agents that are deeply personalized and designed for long-term interaction. In prior work (Spaulding et al, 2021) we laid out the theoretical benefits of a multi-task personalization paradigm and evaluated the combined-task proficiency and data efficiency of the approach in models trained to estimate simulated student mastery in two different game tasks These games, called RHYMERACER and WORDBUILDER, were developed in partnership with experts in children’s media and early literacy learning, and were designed to help young students practice different literacy skills, namely rhyming and spelling. The authors highlight a number of important shifts in viewpoint when adopting this approachrecognizing that human affective response is idiosyncratic (i.e., personalized), dynamic (i.e., changes over time), and contextual (i.e., changes with task or environment) We argue that these same qualities apply more broadly, to many aspects of human interactive behavior, though in this paper we primarily focus on student learning in educational interactions. A modeling approach that acknowledges and accounts for these qualities may be the key to successful, personalized long-term interactions

Lifelong Personalization-Personalized Modeling Across Tasks and Over Time

Research Contributions

Summary of Approach

Perspectives on Lifelong Personalization

PERSONALIZED LITERACY GAME SYSTEM

RhymeRacer

WordBuilder

Strategy and Content Models

Gaussian Processes in Word Space

Transferrable Gaussian Processes

SIMULATED STUDENTS

Simulating Student Performance Data

Dynamic Students

EVALUATING LIFELONG GAUSSIAN PROCESSES AND MULTITASK TRANSFER IN SIMULATION

Multitask Personalization With Stationary Students

Lifelong Personalization With Dynamic Students

Findings

CONCLUSION AND FURTHER DISCUSSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Robotics and AI	Publication Date: Jun 7, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Lifelong Personalization via Gaussian Process Modeling for Long-Term HRI.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI

Lead the way for us

Similar Papers

Is Multi-Task Learning an Upper Bound for Continual Learning?
Zihao Wu ... Soheil Kolouri
-
Zihao Wu, et. al.Zihao Wu ... Soheil Kolouri
04 Jun 2023
04 Jun 2023

Music genre classification using Gaussian Process models
Konstantin Markov ... Tomoko Matsui
-
Konstantin Markov, et. al.Konstantin Markov ... Tomoko Matsui
01 Sep 2013
01 Sep 2013

A Human-Centered View of Continual Learning: Understanding Interactions, Teaching Patterns, and Perceptions of Human Users Towards a Continual Learning Robot in Repeated Interactions
Ali Ayub ... Jainish Mehta
ACM transactions on human-robot interaction | VOL. -
Ali Ayub, et. al.Ali Ayub ... Jainish Mehta
23 May 2024
ACM transactions on human-robot interaction | VOL. -

Targeted Data Poisoning Attacks Against Continual Learning Neural Networks
Huayu Li ... Gregory Ditzler
-
Huayu Li, et. al.Huayu Li ... Gregory Ditzler
18 Jul 2022
18 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lifelong Personalization via Gaussian Process Modeling for Long-Term HRI.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI