Abstract

In video surveillance, person tracking is considered as challenging task. Numerous computer vision, machine and deep learning–based techniques have been developed in recent years. Majority of these techniques are based on frontal view images/video sequences. The advancement of convolutional neural network reforms the way of object tracking. The network layers of convolutional neural network models trained on a number of images or video sequences improve speed and accuracy of object tracking. In this work, the generalization performance of existing pre-trained deep learning models have investigated for overhead view person detection and tracking, under different experimental conditions. The object tracking method Generic Object Tracking Using Regression Networks (GOTURN) which has been yielding outstanding tracking results in recent years is explored for person tracking using overhead views. This work mainly focused on overhead view person tracking using Faster region convolutional neural network (Faster-RCNN) in combination with GOTURN architecture. In this way, the person is first identified in overhead view video sequences and then tracked using a GOTURN tracking algorithm. Faster-RCNN detection model achieved the true detection rate ranging from 90% to 93% with a minimum false detection rate up to 0.5%. The GOTURN tracking algorithm achieved similar results with the success rate ranging from 90% to 94%. Finally, the discussion is made on output results along with future direction.

Highlights

  • Nowadays, convolutional neural network (CNN)-based models achieve remarkable success, in the area of pattern recognition, image processing, remote sensing, data classification, computer vision, and smart surveillance analysis

  • Performing overhead view person tracking using Generic Object Tracking Using Regression Networks (GOTURN) tracking algorithm combined with the Faster-RCNN detection model

  • For person tracking, the Faster-RCNN detection model is combined with GOTURN tracking algorithm

Read more

Summary

Introduction

Convolutional neural network (CNN)-based models achieve remarkable success, in the area of pattern recognition, image processing, remote sensing, data classification, computer vision, and smart surveillance analysis ( in object detection, tracking, and recognition). To detect or identify the object (person), Faster region convolutional neural network (Faster-RCNN)[12] is combined with a GOTURN tracking algorithm In this way, the object (person) is first detected using Faster-RCNN and tracked using the GOTURN algorithm in overhead view video sequences. The GOTURN and Faster-RCNN models were pre-trained using normal frontal view data set, while for testing purpose, the overhead view person data set is used. The present article mainly focuses on the following: Performing overhead view person tracking using GOTURN tracking algorithm combined with the Faster-RCNN detection model. Overhead view person data set is used, containing video sequences having variation in person appearance (including a variety of poses, shapes, and scales of person) and different camera resolutions with indoor and outdoor backgrounds. Detailed explanation of output results and performance evaluation is provided in the ‘‘Experimental results’’ section, and the ‘‘Conclusion’’ section concludes the discussed work

Literature review
Experimental results
Declaration of conflicting interests
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.