Subgoal Identifications in Reinforcement Learning: A Survey

Chung-Cheng Chiu,Von-Wun Soo

doi:10.5772/13214

Abstract

Designing an algorithm to build a program to solve all forms of problems is an attractive idea. The programmers don’t need to spend efforts on figuring out an optimal way to solve the problem, the algorithm itself will explore the problem and automatically finds the solution to the problem. This amazing feature of automatic programming is what makes reinforcement learning so appealing, and have the potential to apply on virtually every single task of our world. Reinforcement learning is a general framework to find an optimal solution for the given task. Its generalization reduces the effort of programmers on mapping a specific task into a reinforcement learning problem, but this feature is also the main performance bottleneck of reinforcement learning. Since there are not many constraints within the framework, the search space of the policy is exponentially proportional to the dimension of the state space. When mapping a high dimensional problem into a reinforcement learning problem, the conventional reinforcement learning algorithm becomes infeasible. Most of the real world problems are high-dimensional, and it is the major limitation for reinforcement learning. Therefore, how to find a way to reduce the search space and improve the search efficiency is the most important challenge. On dealing with a high dimensionality problem, there are two common approaches to improve the performance. One is to reduce the dimensionality; the other is to find a better optimization algorithm. The first approach has drawn much attention in recent years, and many interesting works have been proposed this way to boost the performance of reinforcement learning. This article is going to review some recent advances in the dimensionality reduction approach. Approaches for dimensionality reduction can be classified into two categories. One is value function approximation, and the other is abstraction. The first category approximate the utility function as a specific form of function, usually linear function, and the optimization of value function becomes much more efficient when dealing with linear functions. This approximation not only reduces the cost of optimization, but also provides the abilities to dealing with continuous-variable problems and generalizing the original policy to similar problems. The second category identifies the structure of the problem, represent the problem with higher-level concepts, and remove irrelevant parameters. Approaches within this category identify the sub-spaces of the state space that do not need to include other subspaces when solving the problem. For example, on solving a “make-coffee” problem, we can divide the problem into two sub-problems “reach the coffee maker” and “cook coffee with

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Subgoal Identifications in Reinforcement Learning: A Survey

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 14, 2011
Citations: 11	License type: cc-by-nc-sa

Similar Papers

On Value Function Representation of Long Horizon Problems
Lucas Lehnert ... Romain Laroche
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32
Lucas Lehnert, et. al.Lucas Lehnert ... Romain Laroche
29 Apr 2018
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32

EDA-RL
Hisashi Handa
-
Hisashi HandaHisashi Handa
08 Jul 2009
08 Jul 2009

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Transfer in variable-reward hierarchical reinforcement learning
Neville Mehta ... Sriraam Natarajan
Machine Learning | VOL. 73
Neville Mehta, et. al.Neville Mehta ... Sriraam Natarajan
03 Jun 2008
Machine Learning | VOL. 73

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subgoal Identifications in Reinforcement Learning: A Survey

Abstract

Talk to us

Similar Papers