Physics-Driven Machine Learning for Time-Optimal Path Planning in Stochastic Dynamic Flows

Rohit Chowdhury,Deepak N Subramani

doi:10.1007/978-3-030-61725-7_34

Abstract

AbstractOptimal path planning of autonomous marine agents is important to minimize operational costs of ocean observation systems. Within the context of DDDAS, we present a Reinforcement Learning (RL) framework for computing a dynamically adaptable policy that minimizes expected travel time of autonomous vehicles between two points in stochastic dynamic flows. To forecast the stochastic dynamic environment, we utilize the reduced order data-driven dynamically orthogonal (DO) equations. For planning, a novel physics-driven online Q-learning is developed. First, the distribution of exact time optimal paths predicted by stochastic DO Hamilton-Jacobi level set partial differential equations are utilized to initialize the action value function (Q-value) in a transfer learning approach. Next, the flow data collected by onboard sensors are utilized in a feedback loop to adaptively refine the optimal policy. For the adaptation, a simple Bayesian estimate of the environment is performed (the DDDAS data assimilation loop) and the inferred environment is used to update the Q-values in an \(\epsilon -\)greedy exploration approach (the RL step). To validate our Q-learning solution, we compare it with a fully offline, dynamic programming solution of the Markov Decision Problem corresponding to the RL framework. For this, novel numerical schemes to efficiently utilize the DO forecasts are derived and computationally efficient GPU-implementation is completed. We showcase the new RL algorithm and elucidate its computational advantages by planning paths in a stochastic quasi-geostrophic double gyre circulation.KeywordsPath planningQ-learningMarkov Decision ProcessDynamically orthogonal equationsTransfer Learning

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Physics-Driven Machine Learning for Time-Optimal Path Planning in Stochastic Dynamic Flows

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

OpenGraphGym: A Parallel Reinforcement Learning Framework for Graph Optimization Problems
Weijian Zheng ... Fengguang Song
-
Weijian Zheng, et. al.Weijian Zheng ... Fengguang Song
01 Jan 2020
01 Jan 2020

Reinforcement learning with algorithms from probabilistic structure estimation
Jonathan P Epperlein ... Robert Shorten
Automatica | VOL. 144
Jonathan P Epperlein, et. al.Jonathan P Epperlein ... Robert Shorten
06 Aug 2022
Automatica | VOL. 144

A Neural Network Based Automatic Generation Controller Design through Reinforcement Learning
...
International Journal of Emerging Electric Power Systems | VOL. 6
, et. al. ...
20 May 2006
International Journal of Emerging Electric Power Systems | VOL. 6

Autonomy for surface ship interception
C Mirabito ... J Edwards
-
C Mirabito, et. al.C Mirabito ... J Edwards
01 Jun 2017
01 Jun 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Physics-Driven Machine Learning for Time-Optimal Path Planning in Stochastic Dynamic Flows

Abstract

Talk to us

Similar Papers