Fuzzy Sarsa Learning and the proof of existence of its stationary points

Vali Derhami,Vahid Johari Majd,Majid Nili Ahmadabadi

doi:10.1002/asjc.54

Abstract

AbstractThis paper provides a new Fuzzy Reinforcement Learning (FRL) algorithm based on critic‐only architecture. The proposed algorithm, called Fuzzy Sarsa Learning (FSL), tunes the parameters of conclusion parts of the Fuzzy Inference System (FIS) online. Our FSL is based on Sarsa, which approximates the Action Value Function (AVF) and is an on‐policy method. In each rule, actions are selected according to the proposed modified Softmax action selection so that the final inferred action selection probability in FSL is equivalent to the standard Softmax formula. We prove the existence of fixed points for the proposed Approximate Action Value Iteration (AAVI). Then, we show that FSL satisfies the necessary conditions that guarantee the existence of stationary points for it, which coincide with the fixed points of the AAVI. We prove that the weight vector of FSL with stationary action selection policy converges to a unique value. We also compare by simulation the performance of FSL and Fuzzy Q‐Learning (FQL) in terms of learning speed, and action quality. Moreover, we show by another example the convergence of FSL and the divergence of FQL when both algorithms use a stationary policy. Copyright © 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fuzzy Sarsa Learning and the proof of existence of its stationary points

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Control

Lead the way for us

Journal: Asian Journal of Control	Publication Date: Oct 1, 2008
Citations: 58

Similar Papers

Demonstration of Learned Helplessness with Fuzzy Reinforcement Learning
Vali Derhami ... Zahra Youhannaei
-
Vali Derhami, et. al.Vali Derhami ... Zahra Youhannaei
01 Jan 2008
01 Jan 2008

Fuzzy reinforcement learning and its application in robot navigation
Yong Duan ... Xin-Hexu
-
Yong Duan, et. al. Yong Duan ... Xin-Hexu
01 Jan 2004
01 Jan 2004

The residual gradient FACL algorithm for differential games
Mostafa D Awheda ... Howard M Schwartz
-
Mostafa D Awheda, et. al.Mostafa D Awheda ... Howard M Schwartz
01 May 2015
01 May 2015

Dynamic Self-Generated Fuzzy Systems for Reinforcement Learning
Meng Joo Er ... Yi Zhou
-
Meng Joo Er, et. al. Meng Joo Er ... Yi Zhou
28 Nov 2005
28 Nov 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fuzzy Sarsa Learning and the proof of existence of its stationary points

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Control