Abstract

Reinforcement learning is popular machine learning techniques for optimal planning in complex environment. The maze is a complex environment which has a grid made of an arbitrary number of squares of width and length where finding optimal path, which converge in minimum time, is always a challenging task. There are various reinforcement learning methods where agent learn from environment to find optimal path in maze problems viz. discrete Q-Learning, Dyna-CA Learning and FRIQ-Learning (Fuzzy Rule Interpolation-based Q-Learning). This research intends to carry out a comparative study of these three methods to locate a method with best convergence time. The algorithms pertaining to these methods are tested on MATLAB computational platform for different obstacles configurations of maze to compare their real time parameter of convergence time. The performance results were analyzed and presented. The final result reveals that FRIQ-Learning outperforms the others under all conditions. Keywords: Reinforcement learning, maze environment, Q-Learning, Dyna-CA Learning and FRIQ-Learning.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.