Evolution and Incremental Learning in the Iterated Prisoner's Dilemma

Han-Yang Quek Han-Yang Quek,Kay Chen Tan Kay Chen Tan,Chi-Keong Goh Chi-Keong Goh,H.A. Abbass

doi:10.1109/tevc.2008.2003009

Abstract

This paper examines the comparative performance and adaptability of evolutionary, learning, and memetic strategies to different environment settings in the iterated prisoner's dilemma (IPD). A memetic adaptation framework is developed for IPD strategies to exploit the complementary features of evolution and learning. In the paradigm, learning serves as a form of directed search to guide evolving strategies to attain eventual convergence towards good strategy traits, while evolution helps to minimize disparity in performance among learning strategies. Furthermore, a double-loop incremental learning scheme (ILS) that incorporates a classification component, probabilistic update of strategies and a feedback learning mechanism is proposed and incorporated into the evolutionary process. A series of simulation results verify that the two techniques, when employed together, are able to complement each other's strengths and compensate for each other's weaknesses, leading to the formation of strategies that will adapt and thrive well in complex, dynamic environments.

Full Text