Multimodal Dialogue Research Articles

The RoboHelper project has the goal of developing assistive robots for the elderly. One crucial component of such a robot is a multimodal dialogue architecture, since collaborative task-oriented human–human dialogue is inherently multimodal. In this paper, we focus on a specific type of interaction, Haptic-Ostensive (H-O) actions, that are pervasive in collaborative dialogue. H-O actions manipulate objects, but they also often perform a referring function.We collected 20 collaborative task-oriented human–human dialogues between a helper and an elderly person in a realistic setting. To collect the haptic signals, we developed an unobtrusive sensory glove with pressure sensors. Multiple annotations were then conducted to build the Find corpus. Supervised machine learning was applied to these annotations in order to develop reference resolution and dialogue act classification modules. Both corpus analysis, and these two modules show that H-O actions play a crucial role in interaction: models that include H-O actions, and other extra-linguistic information such as pointing gestures, perform better.For true human–robot interaction, all communicative intentions must of course be recognized in real time, not on the basis of annotated categories. To demonstrate that our corpus analysis is not an end in itself, but can inform actual human–robot interaction, the last part of our paper presents additional experiments on recognizing H-O actions from the haptic signals measured through the sensory glove. We show that even though pressure sensors are relatively imprecise and the data provided by the glove is noisy, the classification algorithms can successfully identify actions of interest within subjects.

Read full abstract

In face-to-face conversations, speakers are continuously checking whether the listener is engaged in the conversation, and they change their conversational strategy if the listener is not fully engaged. With the goal of building a conversational agent that can adaptively control conversations, in this study we analyze listener gaze behaviors and develop a method for estimating whether a listener is engaged in the conversation on the basis of these behaviors. First, we conduct a Wizard-of-Oz study to collect information on a user's gaze behaviors. We then investigate how conversational disengagement, as annotated by human judges, correlates with gaze transition, mutual gaze (eye contact) occurrence, gaze duration, and eye movement distance. On the basis of the results of these analyses, we identify useful information for estimating a user's disengagement and establish an engagement estimation method using a decision tree technique. The results of these analyses show that a model using the features of gaze transition, mutual gaze occurrence, gaze duration, and eye movement distance provides the best performance and can estimate the user's conversational engagement accurately. The estimation model is then implemented as a real-time disengagement judgment mechanism and incorporated into a multimodal dialog manager in an animated conversational agent. This agent is designed to estimate the user's conversational engagement and generate probing questions when the user is distracted from the conversation. Finally, we evaluate the engagement-sensitive agent and find that asking probing questions at the proper times has the expected effects on the user's verbal/nonverbal behaviors during communication with the agent. We also find that our agent system improves the user's impression of the agent in terms of its engagement awareness, behavior appropriateness, conversation smoothness, favorability, and intelligence.

Read full abstract

Multimodal Dialogue Research Articles

Related Topics

Articles published on Multimodal Dialogue

A Multimodal Dialog System for Language Assessment: Current State and Future Directions

Using Vision and Speech Features for Automated Prediction of Performance Metrics in Multimodal Dialogs

Constructionist Design Methodology for Interactive Intelligences

Multimodal control system for autonomous vehicles using speech and gesture recognition

Artificial cognition for social human–robot interaction: An implementation

Multimodal Dialogues: Making Visual Meaning through the Translation of Tacit Knowledge

The roles and recognition of Haptic-Ostensive actions in collaborative multimodal human–human dialogues

A Natural Conversational Virtual Human with Multimodal Dialog System

Fluent Human–Robot Dialogues About Grounded Objects in Home Environments

SIMONA – the Slovak embodied conversational agent

MULTIMODAL FUSION AS COMMUNICATIVE ACTS DURING HUMAN–ROBOT INTERACTION

Gaze awareness in conversational agents

A graphical editor for the SMUIML multimodal user interaction description language

Multi-modal Dialogue Analysis on Turn-taking and Difference of Variations in Brainwave between Cooperative Learners

User Localization During Human-Robot Interaction

Building Autonomous Sensitive Artificial Listeners

End-user programming of a social robot by dialog

Using speech to identify gesture pen strokes in collaborative, multimodal device descriptions

Classifying dialogue in high-dimensional space

Learning and Evaluation of Dialogue Strategies for New Applications: Empirical Methods for Optimization from Small Data Sets

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multimodal Dialogue Research Articles

Related Topics

Articles published on Multimodal Dialogue

A Multimodal Dialog System for Language Assessment: Current State and Future Directions

Using Vision and Speech Features for Automated Prediction of Performance Metrics in Multimodal Dialogs

Constructionist Design Methodology for Interactive Intelligences

Multimodal control system for autonomous vehicles using speech and gesture recognition

Artificial cognition for social human–robot interaction: An implementation

Multimodal Dialogues: Making Visual Meaning through the Translation of Tacit Knowledge

The roles and recognition of Haptic-Ostensive actions in collaborative multimodal human–human dialogues

A Natural Conversational Virtual Human with Multimodal Dialog System

Fluent Human–Robot Dialogues About Grounded Objects in Home Environments

SIMONA – the Slovak embodied conversational agent

MULTIMODAL FUSION AS COMMUNICATIVE ACTS DURING HUMAN–ROBOT INTERACTION

Gaze awareness in conversational agents

A graphical editor for the SMUIML multimodal user interaction description language

Multi-modal Dialogue Analysis on Turn-taking and Difference of Variations in Brainwave between Cooperative Learners

User Localization During Human-Robot Interaction

Building Autonomous Sensitive Artificial Listeners

End-user programming of a social robot by dialog

Using speech to identify gesture pen strokes in collaborative, multimodal device descriptions

Classifying dialogue in high-dimensional space

Learning and Evaluation of Dialogue Strategies for New Applications: Empirical Methods for Optimization from Small Data Sets