Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

Jacob Arkin,Nicholas Roy,Daehyung Park,Matthew R Walter,Thomas M Howard,Subhro Roy,Rohan Paul

doi:10.1177/0278364920917755

Abstract

The goal of this article is to enable robots to perform robust task execution following human instructions in partially observable environments. A robot’s ability to interpret and execute commands is fundamentally tied to its semantic world knowledge. Commonly, robots use exteroceptive sensors, such as cameras or LiDAR, to detect entities in the workspace and infer their visual properties and spatial relationships. However, semantic world properties are often visually imperceptible. We posit the use of non-exteroceptive modalities including physical proprioception, factual descriptions, and domain knowledge as mechanisms for inferring semantic properties of objects. We introduce a probabilistic model that fuses linguistic knowledge with visual and haptic observations into a cumulative belief over latent world attributes to infer the meaning of instructions and execute the instructed tasks in a manner robust to erroneous, noisy, or contradictory evidence. In addition, we provide a method that allows the robot to communicate knowledge dissonance back to the human as a means of correcting errors in the operator’s world model. Finally, we propose an efficient framework that anticipates possible linguistic interactions and infers the associated groundings for the current world state, thereby bootstrapping both language understanding and generation. We present experiments on manipulators for tasks that require inference over partially observed semantic properties, and evaluate our framework’s ability to exploit expressed information and knowledge bases to facilitate convergence, and generate statements to correct declared facts that were observed to be inconsistent with the robot’s estimate of object properties.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The International Journal of Robotics Research	Publication Date: Jun 5, 2020
Citations: 27	License type: cc-by-nc-sa

R Discovery Prime

R Discovery Prime

Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

Abstract

Talk to us

Similar Papers

More From: The International Journal of Robotics Research

Lead the way for us

Similar Papers

The Role of Visual and Semantic Properties in the Emergence of Category-Specific Patterns of Neural Response in the Human Brain.
David D Coggan ... Daniel H Baker
eneuro | VOL. 3
David D Coggan, et. al.David D Coggan ... Daniel H Baker
01 Jul 2016
eneuro | VOL. 3

A data driven approach to understanding the organization of high-level visual cortex
David M Watson ... Timothy J Andrews
Scientific Reports | VOL. 7
David M Watson, et. al.David M Watson ... Timothy J Andrews
15 Jun 2017
Scientific Reports | VOL. 7

Recurrent connectivity supports higher-level visual and semantic object representations in the brain
Jacqueline Von Seth ... Alex Clarke
Communications Biology | VOL. 6
Jacqueline Von Seth, et. al.Jacqueline Von Seth ... Alex Clarke
27 Nov 2023
Communications Biology | VOL. 6

Action and semantic tool knowledge - Effective connectivity in the underlying neural networks.
Nina N Kleineberg ... Peter H Weiss
Human brain mapping | VOL. 39
Nina N Kleineberg, et. al.Nina N Kleineberg ... Peter H Weiss
26 Apr 2018
Human brain mapping | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal estimation and communication of latent semantic knowledge for robust execution of robot instructions

Abstract

Talk to us

Similar Papers

More From: The International Journal of Robotics Research