SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments

Abhinav Rajvanshi,Alvaro Velasquez,Bhoram Lee,Han-Pang Chiu,Xiao Lin,Karan Sikka

doi:10.1609/icaps.v34i1.31506

Abstract

Semantic reasoning and dynamic planning capabilities are crucial for an autonomous agent to perform complex navigation tasks in unknown environments. It requires a large amount of common-sense knowledge, that humans possess, to succeed in these tasks. We present SayNav, a new approach that leverages human knowledge from Large Language Models (LLMs) for efficient generalization to complex navigation tasks in unknown large-scale environments. SayNav uses a novel grounding mechanism, that incrementally builds a 3D scene graph of the explored environment as inputs to LLMs, for generating feasible and contextually appropriate high-level plans for navigation. The LLM-generated plan is then executed by a pre-trained low-level planner, that treats each planned step as a short-distance point-goal navigation sub-task. SayNav dynamically generates step-by-step instructions during navigation and continuously refines future steps based on newly perceived information. We evaluate SayNav on multi-object navigation (MultiON) task, that requires the agent to utilize a massive amount of human knowledge to efficiently search multiple different objects in an unknown environment. We also introduce a benchmark dataset for MultiON task employing ProcTHOR framework that provides large photo-realistic indoor environments with variety of objects. SayNav achieves state-of-the-art results and even outperforms an oracle based baseline with strong ground-truth assumptions by more than 8% in terms of success rate, highlighting its ability to generate dynamic plans for successfully locating objects in large-scale new environments. The code, benchmark dataset and demonstration videos are accessible at https://www.sri.com/ics/computer-vision/saynav.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: May 30, 2024
Citations: 1

Similar Papers

UUV dynamic path planning and trap escape strategies in unknown environment
Xuelian Zhang ... Qing Li
-
Xuelian Zhang, et. al.Xuelian Zhang ... Qing Li
01 Jul 2016
01 Jul 2016

Evaluation of RGB-D SLAM in Large Indoor Environments
Kirill Muravyev ... Konstantin Yakovlev
-
Kirill Muravyev, et. al.Kirill Muravyev ... Konstantin Yakovlev
01 Jan 2021
01 Jan 2021

Coverage Rolling Path Planning of Unknown Environments with Dynamic Heuristic Searching
Xiaoqin Guo
-
Xiaoqin GuoXiaoqin Guo
01 Jan 2009
01 Jan 2009

Crowd-sensing Simultaneous Localization and Radio Fingerprint Mapping Based on Probabilistic Similarity Models
Ran Liu ... Chau Yuen
-
Ran Liu, et. al.Ran Liu ... Chau Yuen
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SayNav: Grounding Large Language Models for Dynamic Planning to Navigation in New Environments

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling