A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos.

Amirhossein Aghamohammadi,Mei Choo Ang,Elankovan A Sundararajan,Ng Kok Weng,Marzieh Mogharrebi,Seyed Yashar Banihashem

doi:10.1371/journal.pone.0192246

Amirhossein Aghamohammadi, Mei Choo Ang + Show 4 more

Open Access

https://doi.org/10.1371/journal.pone.0192246

Copy DOI

Abstract

Visual tracking in aerial videos is a challenging task in computer vision and remote sensing technologies due to appearance variation difficulties. Appearance variations are caused by camera and target motion, low resolution noisy images, scale changes, and pose variations. Various approaches have been proposed to deal with appearance variation difficulties in aerial videos, and amongst these methods, the spatiotemporal saliency detection approach reported promising results in the context of moving target detection. However, it is not accurate for moving target detection when visual tracking is performed under appearance variations. In this study, a visual tracking method is proposed based on spatiotemporal saliency and discriminative online learning methods to deal with appearance variations difficulties. Temporal saliency is used to represent moving target regions, and it was extracted based on the frame difference with Sauvola local adaptive thresholding algorithms. The spatial saliency is used to represent the target appearance details in candidate moving regions. SLIC superpixel segmentation, color, and moment features can be used to compute feature uniqueness and spatial compactness of saliency measurements to detect spatial saliency. It is a time consuming process, which prompted the development of a parallel algorithm to optimize and distribute the saliency detection processes that are loaded into the multi-processors. Spatiotemporal saliency is then obtained by combining the temporal and spatial saliencies to represent moving targets. Finally, a discriminative online learning algorithm was applied to generate a sample model based on spatiotemporal saliency. This sample model is then incrementally updated to detect the target in appearance variation conditions. Experiments conducted on the VIVID dataset demonstrated that the proposed visual tracking method is effective and is computationally efficient compared to state-of-the-art methods.

Highlights

Visual tracking is an active research topic in computer vision
This paper focuses on spatiotemporal saliency detection to deal with the appearance variation difficulties in aerial videos, including a proposed spatial saliency detection method for visual target representation
The videos are collected from VIVID dataset [46], and report appearance variation difficulties, such as complicated background, illumination changes, scale changes, and pose variations

Summary

Introduction

Visual tracking is an active research topic in computer vision. It has been used for many applications, such as activity recognition, surveillance, robotics, and human-computer interaction [1]. Visual tracking algorithms and systems often fail on aerial videos The sources of this failure include appearance variations in the target image caused by relative camera and target motion and inadequate spatial resolution or noise, scale changes, and pose variations [3,4,5]. An efficient visual representation is crucial to describe the target in the scene and generate a sample model [4,8]. The proposed method is able to detect the moving targets efficiently in noisy background and longterm occlusions. Relative distance change (RDC) measure is proposed to distinguish the target from background scene, which is invariant to image rotation, translation, and scaling. The details of the proposed method are detailed in the following subsections

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Feb 13, 2018
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Correction: A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos.
Amirhossein Aghamohammadi ... Elankovan A Sundararajan
PLOS ONE | VOL. 13
Amirhossein Aghamohammadi, et. al.Amirhossein Aghamohammadi ... Elankovan A Sundararajan
29 Mar 2018
PLOS ONE | VOL. 13

A Biologically Inspired Appearance Modeling and Sample Feature-based Approach for Visual Target Tracking in Aerial Images
Lili Pei ... Xiaohui Zhang
International Journal of Advanced Computer Science and Applications | VOL. 14
Lili Pei, et. al.Lili Pei ... Xiaohui Zhang
01 Jan 2023
International Journal of Advanced Computer Science and Applications | VOL. 14

Moving object detection in aerial video based on spatiotemporal saliency
Hao Shen ... Jinglan Zhang
Chinese Journal of Aeronautics | VOL. 26
Hao Shen, et. al.Hao Shen ... Jinglan Zhang
31 Jul 2013
Chinese Journal of Aeronautics | VOL. 26

Video saliency detection by gestalt theory
Yuming Fang ... Haiwen Liu
Pattern Recognition | VOL. 96
Yuming Fang, et. al.Yuming Fang ... Haiwen Liu
02 Aug 2019
Pattern Recognition | VOL. 96

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE