Online Adaptable Learning Rates for the Game Connect-4

Samineh Bagheri,Patrick Koch,Markus Thill,Wolfgang Konen

doi:10.1109/tciaig.2014.2367105

Abstract

Learning board games by self-play has a long tradition in computational intelligence for games. Based on Tesauro's seminal success with TD-Gammon in 1994, many successful agents use temporal difference learning today. But in order to be successful with temporal difference learning on game tasks, often a careful selection of features and a large number of training games is necessary. Even for board games of moderate complexity like Connect-4, we found in previous work that a very rich initial feature set and several millions of game plays are required. In this work we investigate different approaches of online-adaptable learning rates like Incremental Delta Bar Delta (IDBD) or temporal coherence learning (TCL) whether they have the potential to speed up learning for such a complex task. We propose a new variant of TCL with geometric step size changes. We compare those algorithms with several other state-of-the-art learning rate adaptation algorithms and perform a case study on the sensitivity with respect to their meta parameters. We show that in this set of learning algorithms those with geometric step size changes outperform those other algorithms with constant step size changes. Algorithms with nonlinear output functions are slightly better than linear ones. Algorithms with geometric step size changes learn faster by a factor of 4 as compared to previously published results on the task Connect-4.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computational Intelligence and AI in Games	Publication Date: Mar 1, 2016
Citations: 41	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Online Adaptable Learning Rates for the Game Connect-4

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Intelligence and AI in Games

Lead the way for us

Similar Papers

Improving the accuracy of BDF methods for index 3 differential-algebraic equations
Carmen Arévalo ... Per Lötstedt
BIT Numerical Mathematics | VOL. 35
Carmen Arévalo, et. al.Carmen Arévalo ... Per Lötstedt
01 Sep 1995
BIT Numerical Mathematics | VOL. 35

Preservation of the growth rates of delay differential equations by Euler schemes with non-uniform step sizes
John A.D Appleby ... Michael J Mccarthy
Computers & Mathematics with Applications | VOL. 64
John A.D Appleby, et. al.John A.D Appleby ... Michael J Mccarthy
21 Feb 2012
Computers & Mathematics with Applications | VOL. 64

On the zero-stability of multistep methods on smooth nonuniform grids
Gustaf Söderlind ... István Faragó
BIT Numerical Mathematics | VOL. 58
Gustaf Söderlind, et. al.Gustaf Söderlind ... István Faragó
10 Jul 2018
BIT Numerical Mathematics | VOL. 58

Optimally zero stable explicit peer methods with variable nodes
Marcel Klinge ... Helmut Podhaisky
BIT Numerical Mathematics | VOL. 58
Marcel Klinge, et. al.Marcel Klinge ... Helmut Podhaisky
22 Nov 2017
BIT Numerical Mathematics | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Adaptable Learning Rates for the Game Connect-4

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Intelligence and AI in Games