A learning search algorithm with propagational reinforcement learning

Wei Zhang

doi:10.1007/s10489-020-02117-0

Abstract

When reinforcement learning with a deep neural network is applied to heuristic search, the search becomes a learning search. In a learning search system, there are two key components: (1) a deep neural network with sufficient expression ability as a heuristic function approximator that estimates the distance from any state to a goal; (2) a strategy to guide the interaction of an agent with its environment to obtain more efficient simulated experience to update the Q-value or V-value function of reinforcement learning. To date, neither component has been sufficiently discussed. This study theoretically discusses the size of a deep neural network for approximating a product function of p piecewise multivariate linear functions. The existence of such a deep neural network with O(n + p) layers and O(dn + dnp + dp) neurons has been proven, where d is the number of variables of the multivariate function being approximated, 𝜖 is the approximation error, and n = O(p + log2(pd/𝜖)). For the second component, this study proposes a general propagational reinforcement-learning-based learning search method that improves the estimate h(.) according to the newly observed distance information about the goals, propagates the improvement bidirectionally in the search tree, and consequently obtains a sequence of more accurate V-values for a sequence of states. Experiments on the maze problems show that our method increases the convergence rate of reinforcement learning by a factor of 2.06 and reduces the number of learning episodes to 1/4 that of other nonpropagating methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A learning search algorithm with propagational reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Similar Papers

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Benjamin Beyret
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Benjamin Beyret
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Break through the limits of learning by machines
Zhongzhi Shi
Chinese Science Bulletin | VOL. 61
Zhongzhi ShiZhongzhi Shi
20 Sep 2016
Chinese Science Bulletin | VOL. 61

Low dimensional approximation and generalization of multivariate functions on smooth manifolds using deep ReLU neural networks
Demetrio Labate ... Ji Shi
Neural networks : the official journal of the International Neural Network Society | VOL. 174
Demetrio Labate, et. al.Demetrio Labate ... Ji Shi
01 Mar 2024
Neural networks : the official journal of the International Neural Network Society | VOL. 174

Scheduling Large-scale Distributed Training via Reinforcement Learning
Zhanglin Peng ... Ping Luo
-
Zhanglin Peng, et. al.Zhanglin Peng ... Ping Luo
01 Dec 2018
01 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A learning search algorithm with propagational reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence