Representation Learning for Grounded Spatial Reasoning

Michael Janner,Karthik Narasimhan,Regina Barzilay

doi:10.1162/tacl_a_00004

Michael Janner, Karthik Narasimhan + Show 1 more

Open Access

https://doi.org/10.1162/tacl_a_00004

Copy DOI

Abstract

The interpretation of spatial references is highly contextual, requiring joint inference over both language and the environment. We consider the task of spatial reasoning in a simulated environment, where an agent can act and receive rewards. The proposed model learns a representation of the world steered by instruction text. This design allows for precise alignment of local neighborhoods with corresponding verbalizations, while also handling global references in the instructions. We train our model with reinforcement learning using a variant of generalized value iteration. The model outperforms state-of-the-art approaches on several metrics, yielding a 45% reduction in goal localization error.

Highlights

Understanding spatial references in natural language is essential for successful human-robot communication and autonomous navigation
We assume access to a simulated environment, in which an agent can take actions to interact with the world and is rewarded for reaching the location specified by the language instruction
Task setup We model our task as a Markov Decision Process (MDP), where an autonomous agent is placed in an interactive environment with the capability to choose actions that can affect the world

Summary

Introduction

Understanding spatial references in natural language is essential for successful human-robot communication and autonomous navigation. This problem is challenging because interpretation of spatial references is highly context-dependent. We explore the problem of spatial reasoning in the context of interactive worlds. We assume access to a simulated environment, in which an agent can take actions to interact with the world and is rewarded for reaching the location specified by the language instruction. This feedback is the only source of supervision the model uses for interpreting spatial references

Objectives

Methods

Results

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2018
Citations: 72	License type: cc-by

R Discovery Prime

R Discovery Prime

Representation Learning for Grounded Spatial Reasoning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Orbit correction based on improved reinforcement learning algorithm
Xiaolong Chen ... Zhijun Wang
Physical Review Accelerators and Beams | VOL. 26
Xiaolong Chen, et. al.Xiaolong Chen ... Zhijun Wang
13 Apr 2023
Physical Review Accelerators and Beams | VOL. 26

Off-Policy Reinforcement Learning for Robotics

-

30 Mar 2021
30 Mar 2021

Bootstrapping Human-Autonomy Collaborations by using Brain-Computer Interface of SSVEP for Multi-Agent Deep Reinforcement Learning
Joshua Ho ... Chun-Hsiang Chuang
-
Joshua Ho, et. al.Joshua Ho ... Chun-Hsiang Chuang
17 Nov 2022
17 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Representation Learning for Grounded Spatial Reasoning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics