Learning Robot Speech Models to Predict Speech Acts in HRI

Ankuj Arora,Sylvie Pesty,Humbert Fiorino,Damien Pellier

doi:10.1515/pjbr-2018-0015

Abstract

Abstract In order to be acceptable and able to “camouflage” into their physio-social context in the long run, robots need to be not just functional, but autonomously psycho-affective as well. This motivates a long term necessity of introducing behavioral autonomy in robots, so they can autonomously communicate with humans without the need of “wizard” intervention. This paper proposes a technique to learn robot speech models from human-robot dialog exchanges. It views the entire exchange in the Automated Planning (AP) paradigm, representing the dialog sequences (speech acts) in the form of action sequences that modify the state of the world upon execution, gradually propelling the state to a desired goal. We then exploit intra-action and inter-action dependencies, encoding them in the form of constraints. We attempt to satisfy these constraints using aweighted maximum satisfiability model known as MAX-SAT, and convert the solution into a speech model. This model could have many uses, such as planning of fresh dialogs. In this study, the learnt model is used to predict speech acts in the dialog sequences using the sequence labeling (predicting future acts based on previously seen ones) capabilities of the LSTM (Long Short Term Memory) class of recurrent neural networks. Encouraging empirical results demonstrate the utility of this learnt model and its long term potential to facilitate autonomous behavioral planning of robots, an aspect to be explored in future works.

Highlights

In order to be acceptable and able to “camouflage” into their physio-social context in the long run, robots need to be not just functional, but autonomously psycho-affective as well
This paper proposes a technique to learn robot speech models from human-robot dialog exchanges
Tomated Planning (AP) paradigm, representing the dialog sequences in the form of action sequences that modify the state of the world upon execution, gradually propelling the state to a desired goal

Summary

Introduction

Abstract: In order to be acceptable and able to “camouflage” into their physio-social context in the long run, robots need to be not just functional, but autonomously psycho-affective as well. This paper proposes a technique to learn robot speech models from human-robot dialog exchanges. Scripting these subtleties is time consuming, impractical and requires a lot of programming In such a complex scenario, learning the underlying speech model of the robot from HRI dialog sequences is a promising alternative. Learning the underlying model comprising action descriptions from action name-signature sequences could save the effort from having to code these action descriptions from scratch, promoting re usability This model can further be fed to an automated planner to generate fresh dialogs, allowing the robot to communicate autonomously in future scenarios. Our contribution in this paper is the following: given a HRI dialog corpus, our approach learns the robot’s behavioral model comprising of the utterances encoded in the form of actions alongwith their signatures, preconditions and effects.

Related work

Definitions and problem formulation

Approach

Annotation and generalization

Intra-operator constraints

Inter-operator constraints

Data encoding for labelling of operator sequences

Evaluation

LSTM based sequence labeling

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Paladyn, Journal of Behavioral Robotics	Publication Date: Aug 1, 2016
Citations: 1	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Learning Robot Speech Models to Predict Speech Acts in HRI

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Paladyn, Journal of Behavioral Robotics

Lead the way for us

Similar Papers

Approximate Computing for Long Short Term Memory (LSTM) Neural Networks
Sanchari Sen ... Anand Raghunathan
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 37
Sanchari Sen, et. al.Sanchari Sen ... Anand Raghunathan
01 Nov 2018
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 37

Using long short term memory and convolutional neural networks for driver drowsiness detection
Azhar Quddus ... Felix J.E Comeau
Accident Analysis and Prevention | VOL. 156
Azhar Quddus, et. al.Azhar Quddus ... Felix J.E Comeau
10 Apr 2021
Accident Analysis and Prevention | VOL. 156

Modeling plasticity during epileptogenesis by long short term memory neural networks.
Marzieh Shahpari ... Peyman Setoodeh
Cognitive neurodynamics | VOL. 16
Marzieh Shahpari, et. al.Marzieh Shahpari ... Peyman Setoodeh
15 Sep 2021
Cognitive neurodynamics | VOL. 16

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Robot Speech Models to Predict Speech Acts in HRI

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Paladyn, Journal of Behavioral Robotics