Continuous and discretized pursuit learning schemes: various algorithms and their comparison

B.J Oommen,M Agache

doi:10.1109/3477.931507

Abstract

A learning automaton (LA) is an automaton that interacts with a random environment, having as its goal the task of learning the optimal action based on its acquired experience. Many learning automata (LAs) have been proposed, with the class of estimator algorithms being among the fastest ones, Thathachar and Sastry, through the pursuit algorithm, introduced the concept of learning algorithms that pursue the current optimal action, following a reward-penalty learning philosophy. Later, Oommen and Lanctot extended the pursuit algorithm into the discretized world by presenting the discretized pursuit algorithm, based on a reward-inaction learning philosophy. In this paper we argue that the reward-penalty and reward-inaction learning paradigms in conjunction with the continuous and discrete models of computation, lead to four versions of pursuit learning automata. We contend that a scheme that merges the pursuit concept with the most recent response of the environment, permits the algorithm to utilize the LAs long-term and short-term perspectives of the environment. In this paper, we present all four resultant pursuit algorithms, prove the E-optimality of the newly introduced algorithms, and present a quantitative comparison between them.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Continuous and discretized pursuit learning schemes: various algorithms and their comparison

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)

Lead the way for us

Journal: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)	Publication Date: Jun 1, 2001
Citations: 144

Similar Papers

A comparison of continuous and discretized pursuit learning schemes
B.J Oommen ... M Agache
-
B.J Oommen, et. al.B.J Oommen ... M Agache
12 Oct 1999
12 Oct 1999

A new class of learning automata for selecting an optimal subset
Junqi Zhang ... Qi Kang
-
Junqi Zhang, et. al.Junqi Zhang ... Qi Kang
01 Oct 2014
01 Oct 2014

A Comprehensive Survey of Estimator Learning Automata and Their Recent Convergence Results
B John Oommen ... Xuan Zhang
-
B John Oommen, et. al.B John Oommen ... Xuan Zhang
01 Jan 2021
01 Jan 2021

On Using the Theory of Regular Functions to Prove the ε-Optimality of the Continuous Pursuit Learning Automaton
Xuan Zhang ... Ole-Christoffer Granmo
-
Xuan Zhang, et. al.Xuan Zhang ... Ole-Christoffer Granmo
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Continuous and discretized pursuit learning schemes: various algorithms and their comparison

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics)