A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Takayasu Fuchida,Kathy Thi Aung

doi:10.1007/s10015-013-0125-x

Abstract

This paper presents a new adaptive segmentation of continuous state space based on vector quantization algorithm such as Linde---Buzo---Gray for high-dimensional continuous state spaces. The objective of adaptive state space partitioning is to develop the efficiency of learning reward values with an accumulation of state transition vector in a single-agent environment. We constructed our single-agent model in continuous state and discrete actions spaces using Q-learning function. Moreover, the study of the resulting state space partition reveals a Voronoi tessellation. In addition, the experimental results show that this proposed method can partition the continuous state space appropriately into Voronoi regions according to not only the number of actions, but also achieve a good performance of reward-based learning tasks compared with other approaches such as square partition lattice on discrete state space.

Full Text