Zeroth-level Classifier System Research Articles

Learning Classifier Systems (LCSs) are rule-based adaptive systems that have both Reinforcement Learning (RL) and rule-discovery mechanisms for effective and practical on-line learning. With the aim of establishing a common theoretical basis between LCSs and RL algorithms to share each field's findings, a detailed analysis was performed to compare the learning processes of these two approaches. Based on our previous work on deriving an equivalence between the Zeroth-level Classifier System (ZCS) and Q-learning with Function Approximation (FA), this paper extends the analysis to the influence of actually applying the conditions for this equivalence. Comparative experiments have revealed interesting implications: (1) ZCS's original parameter, the deduction rate, plays a role in stabilizing the action selection, but (2) from the Reinforcement Learning perspective, such a process inhibits the ability to accurately estimate values for the entire state-action space, thus limiting the performance of ZCS in problems requiring accurate value estimation.

Read full abstract

In a recent article, Wilson (1994) described a "zeroth-level" classifier system (ZCS). ZCS employs a reinforcement learning technique comparable to Q-learning (Watkins, 1989). This article presents results from the first reconstruction of ZCS. Having replicated Wilson's results, we extend ZCS in a manner suggested by Wilson: The original formulation of ZCS has no memory mechanisms, but Wilson (1994b) suggested how internal "temporary memory" registers could be added. We show results from adding one-bit and two-bit memory registers to ZCS. Our results demonstrate that ZCS can exploit memory facilities efficiently in non-Markov environments. We also show that the memoryless ZCS can converge on near-optimal stochastic solutions in non-Markov environments. We then present results from trials using ZCS in Markov environments that require increasingly long chains of actions before reward is received. Our results indicate that inaccurate overgeneral classifiers can interact with the classifier-generation mechanisms to cause catastrophic breakdowns in overall system performance. Basing classifier fitness on accuracy may alleviate this problem. We conclude that the memory mechanism in its current form is unlikely to scale well for situations requiring large amounts of temporary memory. Nevertheless, the ability to find stochastic solutions when there is insufficient memory might offset this problem somewhat.

Read full abstract

Zeroth-level Classifier System Research Articles

Articles published on Zeroth-level Classifier System

Improving selection strategies in zeroth-level classifier systems based on average reward reinforcement learning

Tournament Selection in Zeroth-Level Classifier Systems Based On Average Reward Reinforcement Learning

Analyzing Strength-Based Classifier System from Reinforcement Learning Perspective

분류자 시스템을 이용한 축구 로봇의 행동 전략

An accuracy based corporate classifier system

Self-adaptation in classifier system controllers

Adding Temporary Memory to ZCS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Zeroth-level Classifier System Research Articles

Articles published on Zeroth-level Classifier System

Improving selection strategies in zeroth-level classifier systems based on average reward reinforcement learning

Tournament Selection in Zeroth-Level Classifier Systems Based On Average Reward Reinforcement Learning

Analyzing Strength-Based Classifier System from Reinforcement Learning Perspective

분류자 시스템을 이용한 축구 로봇의 행동 전략

An accuracy based corporate classifier system

Self-adaptation in classifier system controllers

Adding Temporary Memory to ZCS