Gated-Attention Architectures for Task-Oriented Language Grounding

Devendra Singh Chaplot,Ruslan Salakhutdinov,Dheeraj Rajagopal,Kanthashree Mysore Sathyendra,Rama Kumar Pasumarthi

doi:10.1609/aaai.v32i1.11832

Abstract

To perform tasks specified by natural language instructions, autonomous agents need to extract semantically meaningful representations of language and map it to visual elements and actions in the environment. This problem is called task-oriented language grounding. We propose an end-to-end trainable neural architecture for task-oriented language grounding in 3D environments which assumes no prior linguistic or perceptual knowledge and requires only raw pixels from the environment and the natural language instruction as input. The proposed model combines the image and text representations using a Gated-Attention mechanism and learns a policy to execute the natural language instruction using standard reinforcement and imitation learning methods. We show the effectiveness of the proposed model on unseen instructions as well as unseen maps, both quantitatively and qualitatively. We also introduce a novel environment based on a 3D game engine to simulate the challenges of task-oriented language grounding over a rich set of instructions and environment states.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gated-Attention Architectures for Task-Oriented Language Grounding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 29, 2018
Citations: 57

Similar Papers

Attention Based Natural Language Grounding by Navigating Virtual Environment
Abhishek Sinha ... Mausoom Sarkar
-
Abhishek Sinha, et. al.Abhishek Sinha ... Mausoom Sarkar
01 Jan 2019
01 Jan 2019

Non-instructional linguistic communication with virtual actors
M Cavazza ... S.J Mead
-
M Cavazza, et. al.M Cavazza ... S.J Mead
01 Jan 2001
01 Jan 2001

Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen ... Cordelia Schmid
-
Shizhe Chen, et. al.Shizhe Chen ... Cordelia Schmid
01 Jan 2021
01 Jan 2021

Learning and Executing Re-Usable Behaviour Trees From Natural Language Instruction
Gavin Suddrey ... Ben Talbot
IEEE Robotics and Automation Letters | VOL. 7
Gavin Suddrey, et. al.Gavin Suddrey ... Ben Talbot
01 Oct 2022
IEEE Robotics and Automation Letters | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gated-Attention Architectures for Task-Oriented Language Grounding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence