Abstract

Target tracking is a process that may find applications in different domains such as video surveillance, robot navigation and human computer interaction. In this work we have considered the problem of tracking a moving object in a multi agent environment. The environment is a rectangular space bounded by walls. The first agent is the target and it moves randomly in the space. The second agent should follow the target, keeping as close as possible without crashing with it. It uses sensors to detect the position of the target. The sensor readings give the distance and the angle from the target. We use reinforcement learning to train the tracker to detect any change in the movement of the target and stay within a certain range from it. Reinforcement learning is a form of machine learning in which the agent learns by interacting with the environment. By doing so, for each action taken, the agent receives a reward from the environment, which is used to determine positive or negative behaviour. The goal of the agent is to maximise the total reward received during the interaction. This form of machine learning has applications in different areas, such as: game solving with the most known game being AlphaGO; robotics, for design of hard-to engineer behaviours; traffic light control, personalized recommendations, etc. The sensor readings may have continuous values, making a very large state space. We approximate the value function using neural networks and use different reward functions for learning the best policy.

Highlights

  • Target tracking is a process that may find applications in different domains such as video surveillance, robot navigation and human computer interaction

  • Reinforcement learning is a form of machine learning that is based on learning from experience

  • This paper develops a new paradigm for solving the problem of visual tracking, by using recurrent neural networks and reinforcement learning in order to exploit temporal correlation in videos. [11] solved the problem of multi object tracking (MOT) using collaborative deep reinforcement learning

Read more

Summary

INTRODUCTION1

Object tracking is an area that has many applications in different domains, some of them being human computer interaction, video surveillance and robot navigation. Some of the tasks a robot can perform may include rescue operation, disaster relief, patrolling, autonomous navigation, personal assistants, surgical assistant etc All these tasks may require some form of object tracking, having a target that needs to be followed, for example a personal assistant that follows you carrying your bag. Machine learning has become a very strong tool for solving different types of problems and even surpassing the humans in certain areas. Reinforcement learning is a form of machine learning that is based on learning from experience. Since reinforcement learning requires some form of reward to be designed in order to orient the learner goals, we will try different rewards and will see how they affect the result. In part 5 we describe the simulation that we have done, the environment and the experiment, and in part 6 give results and conclusions gathered from this work

LITERATURE REVIEW
General Presentation
On-Policy and Off-Policy Learning
TEMPORAL DIFFERENCE LEARNING
On-Policy TD Control with Sarsa
Neural Network Implementation Approach
Neural Fitted Q-Learning
Deep Q-network
THE PROBLEM
The Framework Used
Proposed Solution
Results
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call