A Sarsa(λ) Algorithm Based on Double-Layer Fuzzy Reasoning

Quan Liu,Yonggang Zhang,Xiang Mu,Qiming Fu,Wei Huang

doi:10.1155/2013/561026

Quan Liu, Yonggang Zhang + Show 3 more

Open Access

https://doi.org/10.1155/2013/561026

Copy DOI

Abstract

Solving reinforcement learning problems in continuous space with function approximation is currently a research hotspot of machine learning. When dealing with the continuous space problems, the classicQ-iteration algorithms based on lookup table or function approximation converge slowly and are difficult to derive a continuous policy. To overcome the above weaknesses, we propose an algorithm named DFR-Sarsa(λ) based on double-layer fuzzy reasoning and prove its convergence. In this algorithm, the first reasoning layer uses fuzzy sets of state to compute continuous actions; the second reasoning layer uses fuzzy sets of action to compute the components ofQ-value. Then, these two fuzzy layers are combined to compute theQ-value function of continuous action space. Besides, this algorithm utilizes the membership degrees of activation rules in the two fuzzy reasoning layers to update the eligibility traces. Applying DFR-Sarsa(λ) to the Mountain Car and Cart-pole Balancing problems, experimental results show that the algorithm not only can be used to get a continuous action policy, but also has a better convergence performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematical Problems in Engineering	Publication Date: Jan 1, 2013
Citations: 13	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

A Sarsa(λ) Algorithm Based on Double-Layer Fuzzy Reasoning

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering

Lead the way for us

Similar Papers

Continuous-action reinforcement learning with fast policy search and adaptive basis function selection
Xin Xu ... Chunming Liu
Soft Computing | VOL. 15
Xin Xu, et. al.Xin Xu ... Chunming Liu
28 Mar 2010
Soft Computing | VOL. 15

Behavior Learning of Autonomous Agents in Continuous State Using Function Approximation
Min-Kyu Shon ... Junichi Murata
-
Min-Kyu Shon, et. al.Min-Kyu Shon ... Junichi Murata
01 Jan 2004
01 Jan 2004

Energy management of hybrid electric bus based on deep reinforcement learning in continuous state and action space
Huachun Tan ... Yuankai Wu
Energy Conversion and Management | VOL. 195
Huachun Tan, et. al.Huachun Tan ... Yuankai Wu
18 May 2019
Energy Conversion and Management | VOL. 195

Monte Carlo Tree Search in Continuous Spaces Using Voronoi Optimistic Optimization with Regret Bounds
Beomjoon Kim ... Kyungjae Lee
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Beomjoon Kim, et. al.Beomjoon Kim ... Kyungjae Lee
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Sarsa(λ) Algorithm Based on Double-Layer Fuzzy Reasoning

Abstract

Talk to us

Similar Papers

More From: Mathematical Problems in Engineering