Survey on the Research Progress and Development Trend of Vision-and-Language Navigation

Kai Niu,Peng Wang

doi:10.3724/sp.j.1089.2022.19249

Abstract

Vision-and-language navigation is a newly emerging research topic developing rapidly in recent years, and it is one of the representative research tasks in the frontier field of vision-language interaction. The goal of this task is to realize autonomous navigation based on visual perception of environment according to language instructions given by human. This paper reviews the recent progress in vision-and-language navigation. Firstly, the research content of this task is introduced, and the three main problems and challenges of cross-modal semantic alignments, semantic understanding and reasoning, and generalization ability enhancement are analyzed. Secondly, commonly-used datasets and evaluation metrics are listed. Thirdly, the research progress of this task is summarized from four aspects of imitation learning, reinforcement learning, self-supervised learning and other methods, and the effects of the typical solutions are carefully compared and analyzed. Fourthly, the current research trends of this task are discussed, which mainly include continuous environment navigation, advanced complex instruction comprehension and common sense reasoning. Finally, the future development directions such as 3D visual-and-language navigation, embodied question answering and interactive question answering are further discussed and prospected.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Survey on the Research Progress and Development Trend of Vision-and-Language Navigation

Abstract

Talk to us

Similar Papers

More From: Journal of Computer-Aided Design & Computer Graphics

Lead the way for us

Similar Papers

Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.
Longlong Jing ... Yingli Tian
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43
Longlong Jing, et. al.Longlong Jing ... Yingli Tian
04 May 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 43

Self-supervised reinforcement learning-based energy management for a hybrid electric vehicle
Chunyang Qi ... Yiwen Zhu
Journal of Power Sources | VOL. 514
Chunyang Qi, et. al.Chunyang Qi ... Yiwen Zhu
01 Dec 2021
Journal of Power Sources | VOL. 514

Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant Phenotyping.
Franklin C Ogidi ... Ian Stavness
Plant phenomics (Washington, D.C.) | VOL. 5
Franklin C Ogidi, et. al.Franklin C Ogidi ... Ian Stavness
01 Jan 2023
Plant phenomics (Washington, D.C.) | VOL. 5

CS-CO: A Hybrid Self-Supervised Visual Representation Learning Method for H&E-stained Histopathological Images.
Pengshuai Yang ... Rui Jiang
Medical Image Analysis | VOL. 81
Pengshuai Yang, et. al.Pengshuai Yang ... Rui Jiang
01 Oct 2022
Medical Image Analysis | VOL. 81

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Survey on the Research Progress and Development Trend of Vision-and-Language Navigation

Abstract

Talk to us

Similar Papers

More From: Journal of Computer-Aided Design &amp; Computer Graphics

More From: Journal of Computer-Aided Design & Computer Graphics