Reflex-based open-vocabulary navigation without prior knowledge using omnidirectional camera and multiple vision-language models

Kento Kawaharazuka,Yoshiki Obinata,Naoaki Kanazawa,Naoto Tsukamoto,Kei Okada,Masayuki Inaba

doi:10.1080/01691864.2024.2393409

Abstract

Various robot navigation methods have been developed, but they are mainly based on Simultaneous Localization and Mapping (SLAM), reinforcement learning, etc., which require prior map construction or learning. In this study, we consider the simplest method that does not require any map construction or learning, and execute open-vocabulary navigation of robots without any prior knowledge to do this. We applied an omnidirectional camera and pre-trained vision-language models to the robot. The omnidirectional camera provides a uniform view of the surroundings, thus eliminating the need for complicated exploratory behaviors including trajectory generation. By applying multiple pre-trained vision-language models to this omnidirectional image and incorporating reflective behaviors, we show that navigation becomes simple and does not require any prior setup. Interesting properties and limitations of our method are discussed based on experiments with the mobile robot Fetch.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reflex-based open-vocabulary navigation without prior knowledge using omnidirectional camera and multiple vision-language models

Abstract

Talk to us

Similar Papers

More From: Advanced Robotics

Lead the way for us

Similar Papers

Building consistent local submaps with omnidirectional SLAM
Cyril Joly ... Patrick Rives
-
Cyril Joly, et. al.Cyril Joly ... Patrick Rives
01 Sep 2009
01 Sep 2009

Accurate and Robust Monocular SLAM with Omnidirectional Cameras.
Shuoyuan Liu ... Peng Guo
Sensors | VOL. 19
Shuoyuan Liu, et. al.Shuoyuan Liu ... Peng Guo
16 Oct 2019
Sensors | VOL. 19

Adapting a real-time monocular visual SLAM from conventional to omnidirectional cameras
Daniel Gutierrez ... Alejandro Rituerto
-
Daniel Gutierrez, et. al.Daniel Gutierrez ... Alejandro Rituerto
01 Nov 2011
01 Nov 2011

Visual SLAM based on EKF filtering algorithm from omnidirectional camera
Chen Hui ... Ma Shiwei
-
Chen Hui, et. al.Chen Hui ... Ma Shiwei
01 Aug 2013
01 Aug 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reflex-based open-vocabulary navigation without prior knowledge using omnidirectional camera and multiple vision-language models

Abstract

Talk to us

Similar Papers

More From: Advanced Robotics