CURIOSITY-DRIVEN REINFORCEMENT LEARNING AGENT FOR MAPPING UNKNOWN INDOOR ENVIRONMENTS

N Botteghi,B Sirmacek,M Poel,C Brune,R Schulte

doi:10.5194/isprs-annals-v-1-2021-129-2021

Abstract

Abstract. Autonomously exploring and mapping is one of the open challenges of robotics and artificial intelligence. Especially when the environments are unknown, choosing the optimal navigation directive is not straightforward. In this paper, we propose a reinforcement learning framework for navigating, exploring, and mapping unknown environments. The reinforcement learning agent is in charge of selecting the commands for steering the mobile robot, while a SLAM algorithm estimates the robot pose and maps the environments. The agent, to select optimal actions, is trained to be curious about the world. This concept translates into the introduction of a curiosity-driven reward function that encourages the agent to steer the mobile robot towards unknown and unseen areas of the world and the map. We test our approach in explorations challenges in different indoor environments. The agent trained with the proposed reward function outperforms the agents trained with reward functions commonly used in the literature for solving such tasks.

Highlights

The problem of autonomous robot navigation is traditionally tackled by employing environment representations, i.e. maps, that are used to plan a collision-free path to reach specific target locations
The reinforcement learning algorithm, i.e. DDPG, only relies on 80 2D-LiDAR readings, the robot’s pose estimate coming from the Rao Blackwellized particle filter (RBPF) Simultaneous localization and mapping (SLAM) algorithm, the previous action taken by the agent, the percentage of the map to be explored, and the time steps left before the end of the episode
4.2 Exploration by Reward Shaping We propose an adaptation of the episodic curiosity reward introduced by (Savinov et al, 2018) for improving the exploration skills of the reinforcement learning algorithm in the context of active SLAM and we investigate its effect on the generalization skills of the trained agent to different environment topologies

Summary

Introduction

The problem of autonomous robot navigation is traditionally tackled by employing environment representations, i.e. maps, that are used to plan a collision-free path to reach specific target locations. These indoor maps are usually constructed using Simultaneous Localization and Mapping, or SLAM, algorithms (Thrun et al, 2005). Successful reinforcement learning-based solutions are proposed in (Wu et al, 2018), (Tai et al, 2017), (Zhelo et al, 2018), (Pfeiffer et al, 2017),(Zhang et al, 2018), and (Zhang et al, 2020) Such map-less path planners often require long training times and a high amount of data to perform well. The actor network π(s; θπ) is updated with the deterministic policy gradient theorem and it is shown in Equation (1)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences	Publication Date: Jun 17, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CURIOSITY-DRIVEN REINFORCEMENT LEARNING AGENT FOR MAPPING UNKNOWN INDOOR ENVIRONMENTS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

Lead the way for us

Similar Papers

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Lucy Cheke
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Lucy Cheke
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Relational-grid-world: a novel relational reasoning environment and an agent model for relational information extraction
Faruk Küçüksubaşi ... Elif Sürer
TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES | VOL. 29
Faruk Küçüksubaşi, et. al.Faruk Küçüksubaşi ... Elif Sürer
30 Mar 2021
TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES | VOL. 29

AI-driven ventilation control policy proximal optimization coupled with dynamic-informed real-time model calibration for healthy and sustainable indoor PM2.5 management
Chanhyeok Jeong ... Changkyoo Yoo
Energy & Buildings | VOL. 323
Chanhyeok Jeong, et. al.Chanhyeok Jeong ... Changkyoo Yoo
10 Sep 2024
Energy & Buildings | VOL. 323

Artificial intelligence: Friend or foe?
Anusch Yazdani ... Ben Kroon
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 63
Anusch Yazdani, et. al.Anusch Yazdani ... Ben Kroon
01 Apr 2023
Australian and New Zealand Journal of Obstetrics and Gynaecology | VOL. 63

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CURIOSITY-DRIVEN REINFORCEMENT LEARNING AGENT FOR MAPPING UNKNOWN INDOOR ENVIRONMENTS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences