Multi-Legged Robot Control Using GA-Based Q-Learning Method With Neighboring Crossover

Tadahiko Murata,Masatoshi Yamaguchi

doi:10.5772/5463

Abstract

Recently reinforcement learning has received much attention as a learning method (Sutton, 1988; Watkins & Dayan, 1992). It does not need a priori knowledge and has higher capability of reactive and adaptive behaviors. However, there are some significant problems in applying it to real problems. Some of them are deep cost of learning and large size of actionstate space. The Q-learning (Watkins & Dayan, 1992), known as one of effective reinforcement learning, has difficulty in accomplishing learning tasks when the size of action-state space is large. Therefore the application of the usual Q-learning is restricted to simple tasks with the small action-state space. Due to the large action-state space, it is difficult to apply the Q-learning directly to real problems such as control problem for robots with many redundant degrees of freedom. In order to cope with such difficulty of large action-state space, various structural and dividing algorithms of the action-state space were proposed (Holland, 1986; Svinin et al., 2001; Yamada et al., 2001). In the dividing algorithm, the state space is divided dynamically, however, the action space is fixed so that it is impossible to apply the algorithm to the task with large action space. In the classifier system, “don’t care” attribute is introduced in order to create general rules. But, that causes the partially observable problem. Furthermore, an ensemble system of general and special rules should be prepared in advance. Considering these points, Ito & Matsuno (2002) have proposed a GA-based Q-learning method called “Q-learning with Dynamic Structuring of Exploration Space Based on Genetic Algorithm (QDSEGA).” In their algorithm, a genetic algorithm is employed to reconstruct an action-state space which is learned by Q-learning. That is, the size of the action-state space is reduced by the genetic algorithm in order to apply Q-learning to the learning process of that space. They applied their algorithm to a control problem of multi-legged robot which has many redundant degrees of freedom and large action-state space. By applying their restriction method for action-state space, they successfully obtained the control rules for a multi-legged robot by their QDSEGA. However, the way to apply a genetic algorithm seems so straightfoward in their study. Therefore we propose a crossover and a modified fitness definition for QDSEGA (Murata & Yamaguchi, 2005; Murata & Aoki,

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Legged Robot Control Using GA-Based Q-Learning Method With Neighboring Crossover

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Apr 1, 2008
Citations: 11	License type: cc-by-nc-sa

Similar Papers

GA-Based Q-Learning to Develop Compact Control Table for Multiple Agents
Tadahiko Murata ... Yusuke Aoki
-
Tadahiko Murata, et. al.Tadahiko Murata ... Yusuke Aoki
01 Feb 2010
01 Feb 2010

Applying QDSEGA to the multi legged robot
Kazuyuki Ito ... Fumitoshi Matsuno
Transactions of the Japanese Society for Artificial Intelligence | VOL. 17
Kazuyuki Ito, et. al.Kazuyuki Ito ... Fumitoshi Matsuno
01 Jan 2002
Transactions of the Japanese Society for Artificial Intelligence | VOL. 17

A study of reinforcement learning for the robot with many degrees of freedom - acquisition of locomotion patterns for multi-legged robot
K Ito ... F Matsuno
-
K Ito, et. al.K Ito ... F Matsuno
07 Aug 2002
07 Aug 2002

ＧＡにより探索空間の動的生成を行うＱ学習による実多自由度ロボットの制御―階層構造の拡張と蛇型ロボットヘの適用―
Kazuyuki Ito ... Fumitoshi Matsuno
Journal of the Robotics Society of Japan | VOL. 21
Kazuyuki Ito, et. al.Kazuyuki Ito ... Fumitoshi Matsuno
01 Jan 2003
Journal of the Robotics Society of Japan | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Legged Robot Control Using GA-Based Q-Learning Method With Neighboring Crossover

Abstract

Talk to us

Similar Papers