RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators From RL Policies

Hao-Tien Lewis Chiang,Jasmine Hsu,Lydia Tapia,Marek Fiser,Aleksandra Faust

doi:10.1109/lra.2019.2931199

Hao-Tien Lewis Chiang, Jasmine Hsu + Show 3 more

Open Access

https://doi.org/10.1109/lra.2019.2931199

Copy DOI

Abstract

This letter addresses two challenges facing samplingbased kinodynamic motion planning: a way to identify good candidate states for local transitions and the subsequent computationally intractable steering between these candidate states. Through the combination of sampling-based planning, a Rapidly Exploring Randomized Tree (RRT) and an efficient kinodynamic motion planner through machine learning, we propose an efficient solution to long-range planning for kinodynamic motion planning. First, we use deep reinforcement learning to learn an obstacle-avoiding policy that maps a robot's sensor observations to actions, which is used as a local planner during planning and as a controller during execution. Second, we train a reachability estimator in a supervised manner, which predicts the RL policy's time to reach a state in the presence of obstacles. Lastly, we introduce RL-RRT that uses the RL policy as a local planner, and the reachability estimator as the distance function to bias tree-growth towards promising regions. We evaluate our method on three kinodynamic systems, including physical robot experiments. Results across all three robots tested indicate that RL-RRT outperforms state of the art kinodynamic planners in efficiency, and also provides a shorter path finish time than a steering function free method. The learned local planner policy and accompanying reachability estimator demonstrate transferability to the previously unseen experimental environments, making RL-RRT fast because the expensive computations are replaced with simple neural network inference.

Highlights

C ONSIDER motion planning for robots such as UAVs [16], autonomous ships [3], and spacecrafts [22]
To address the lack of available steering functions, good distance functions for aiding tree growth, and obstacle-awareness facing kinodynamic motion planning, we propose Reinforcement Learning (RL)-Rapidly Exploring Randomized Tree (RRT), which combines RL and sampling-based planning
To train a policy robust against noise, we model the RL policy is a solution for a continuous state, continuous action, partially observable Markov decision process (POMDP) given as a tuple (Ω, S, A, D, R, γ, O) of observations, state, actions, dynamics, reward, scalar discount, γ ∈ (0, 1), and observation probability

Summary

Introduction

C ONSIDER motion planning for robots such as UAVs [16], autonomous ships [3], and spacecrafts [22]. Manuscript received February 24, 2019; accepted June 27, 2019. Date of publication July 25, 2019; date of current version August 15, 2019. This letter was recommended for publication by Associate Editor H. Amato upon evaluation of the reviewers’ comments.

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Robotics and Automation Letters	Publication Date: Oct 1, 2019
Citations: 108	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators From RL Policies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Similar Papers

Robot learning-enhanced tree-based algorithms for kinodynamic motion planning: A comparative analysis
Liang Hu
Applied and Computational Engineering | VOL. 76
Liang HuLiang Hu
16 Jul 2024
Applied and Computational Engineering | VOL. 76

A tuned approach to feedback motion planning with RRTs under model uncertainty
Guilherme J Maeda ... Surya P N Singh
-
Guilherme J Maeda, et. al.Guilherme J Maeda ... Surya P N Singh
01 May 2011
01 May 2011

Guided RRT: A greedy search strategy for kinodynamic motion planning
Jun Zhang ... Mukunda Bharatheesha
-
Jun Zhang, et. al.Jun Zhang ... Mukunda Bharatheesha
01 Dec 2014
01 Dec 2014

Motion planning a aerial robot using rapidly-exploring random trees with dynamic constraints
Jongwoo Kim ... J.P Ostrowski
-
Jongwoo Kim, et. al. Jongwoo Kim ... J.P Ostrowski
10 Nov 2003
10 Nov 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators From RL Policies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters