Application of Deep Reinforcement Learning for Tracking Control of 3WD Omnidirectional Mobile Robot

Atif Mehmood,Ahsan Ali,Inam Ul Hasan Shaikh

doi:10.5755/j01.itc.50.3.25979

Abstract

Deep reinforcement learning, the fastest growing technique, to solve real-world complex problems by creatinga simple mathematical framework. It includes an agent, action, environment, and a reward. An agent will interactwith the environment, takes an optimal action aiming to maximize the total reward. This paper proposesthe compelling technique of deep deterministic policy gradient for solving the complex continuous actionspace of 3-wheeled omnidirectional mobile robots. Three-wheeled Omnidirectional mobile robots tracking isa difficult task because of the orientation of the wheels which makes it rotate around its own axis rather tofollow the trajectory. A deep deterministic policy gradient (DDPG) algorithm has been designed to train in environmentswith continuous action space to follow the trajectory by training the neural networks defined forthe policy and value function to maximize the reward function defined for the tracking of the trajectory. DDPGagent environment is created in the Reinforcement learning toolbox in MATLAB 2019 while for Actor and criticnetwork design deep neural network designer is used. Results are shown to illustrate the effectiveness of thetechnique with a convergence of error approximately to zero.

Highlights

Wheeled mobile robots have many advantages compared to their legged counterparts such as structural simplicity, energy efficiency, high locomotion speed, and low cost of manufacturing
Reinforcement learning is a recent and much powerful approach that can be used for wheeled mobile robots, as it enables us to find an optimal solution to a
Simulation for the validation of the results has been done in MATLAB 19 and the Reinforcement learning toolbox is used for environment creation, actor-critic networks, agent, and training of that agent

Summary

Introduction

Wheeled mobile robots have many advantages compared to their legged counterparts such as structural simplicity, energy efficiency, high locomotion speed, and low cost of manufacturing. One of the types of a wheeled mobile robot is holonomic wheeled mobile robots which can be designed to move in any direction without changing its orientation The control hierarchy of wheeled mobile robots is often categorized as high-level and low-level. Reinforcement learning is a recent and much powerful approach that can be used for wheeled mobile robots, as it enables us to find an optimal solution to a wwidereescaolrpeeaodfycodmisptalenxt dfoecrisaionm-amcahkinineg. Observation – Any value that can be measure and visible to an agent. S∈S develop an agent that can observe outputs, refer-and Policy-based (stochastic policy) [17]

Objectives

Results

Conclusion