Omnidirectional-Wheel Conveyor Path Planning and Sorting Using Reinforcement Learning Algorithms

Walid Zaher,Arsany W Youssef,Eman Azab,Lamia A Shihata,Maggie Mashaly

doi:10.1109/access.2022.3156924

Walid Zaher, Arsany W Youssef + Show 3 more

Open Access

https://doi.org/10.1109/access.2022.3156924

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 13	License type: CC BY-NC-ND 4.0

Affiliation: German University in Cairo

Abstract

In this paper, path planning and sorting of packages for Omnidirectional-Wheel conveyor are presented using Reinforcement Learning (RL). Q-learning, Double Q-learning, Deep Q-learning, and the Double Deep Q-learning algorithms are investigated. The RL algorithms enable the conveyor to self-learn the packages path and sort them without using conventional control or path planning theories. The RL algorithms are used for two different case studies on conveyors structures with different numbers of cells to compare and evaluate their performances in large- and small-scale sized structures. To explore the proposed methods response to external environment effects, two types of collisions between multiple packages were considered, the proposed RL algorithms showed their ability to resolve both types successfully. Comparative study between multiple RL algorithms for path planning showed that the Q-learning and Double Q-learning algorithms had outperformed their Deep learning versions for path planning in the two case studies. Furthermore, the proposed RL methods are compared experimentally to classic control and path planning theories using a hardware prototype for one of the presented case studies. The hardware experimental results showed that the proposed RL methods were as successful as the conventional methods in path planning and sorting in much less processing time. Two types of sorting scenarios (Type I and II) were tested for same package type and for multiple ones. For Type I sorting the Q-learning algorithm performed better than the Q-learning with weights approach, achieving better mean and minimum rewards while maximum rewards remain the same for both techniques. As for Type II sorting, only the Q-learning with weights approach was able to achieve it and converge in a reasonable time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Omnidirectional-Wheel Conveyor Path Planning and Sorting Using Reinforcement Learning Algorithms

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Omnidirectional-Wheel Conveyor Path Planning and Sorting Using Reinforcement Learning Algorithms

Abstract

Talk to us

Similar Papers

More From: IEEE Access