Multimodal Reference Resolution In Collaborative Assembly Tasks

Dimosthenis Kontogiorgos,Joakim Gustafson,Andre Pereira,Gabriel Skantze,Elena Sibirtseva

doi:10.1145/3279972.3279976

Multimodal Reference Resolution In Collaborative Assembly Tasks

Dimosthenis Kontogiorgos, Joakim Gustafson + Show 3 more

https://doi.org/10.1145/3279972.3279976

Copy DOI

Publication Date: Oct 16, 2018

Citations: 12

Affiliation: KTH Royal Institute of Technology

#Collaborative Tasks #Multimodal Reference Resolution + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Humans use verbal and non-verbal cues to communicate their intent in collaborative tasks. In situated dialogue, speakers typically direct their interlocutor's attention to referent objects using multimodal cues, and references to such entities are resolved in a collaborative nature. In this study we designed a multiparty task where humans teach each other how to assemble furniture, and captured eye-gaze, speech and pointing gestures. We analysed which multimodal cues carry the most information for resolving referring expressions, and report an object saliency classifier that using a multisensory input from speaker and addressee, detects the referent objects during the collaborative task.

Full Text