Abstract

In the field of artificial intelligence, learning automaton (LA) is a self-adaptive decision-maker which plays an important role in reinforcement learning (RL). Games of learning automata are stochastic games with incomplete information that have received frequent usage. Traditional learning automata schemes using in games are parameter-based schemes which exist a tunable parameter (stepsize) changing with different environments. In this paper, we proposed Bayesian method-based parameter-free learning automata (BPFLA) for two-player stochastic games with incomplete information. The parameter-free property indicates that a set of parameters in the scheme can be universally applicable for all configurations of games. Besides, simulation results demonstrate that BPFLA has much faster convergence rate than traditional schemes using games of learning automata with equal or higher accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.