Abstract

With the recent development of earth observation technology, the satellites have obtained the ability to capture city-scale videos, which enable potential applications in intelligent traffic management. Because of the broad field-of-view, the moving vehicles in satellite videos are usually composed of only tens of pixels, making it difficult to differentiate true objects from noise and other distractors. In addition, the edges of tall building tops are often mistakenly detected as moving vehicles because of the effects of motion parallax. This article proposed a terse framework that can effectively suppress false targets, achieving high precision and recall. The study involves three parts: 1) An adaptive filtering method is proposed to reduce noise, thus making the detection algorithm more reliable; 2) Several background subtraction models are tested, and the best one is chosen to produce the preliminary detection results at high recall but low accuracy; 3) A lightweight convolutional neural network (LCNN) is designed and trained on a small collection of samples, and then used to eliminate false targets. The experiments and evaluations demonstrate that our method can largely improve the precision at the expense of a slight reduction of recall.

Highlights

  • I N recent years, commercial satellite technology has achieved significant development in capturing highresolution videos

  • We proposed a detection framework based on background subtraction modeling and convolutional neural network (CNN)

  • 2) Evaluation of the lightweight CNN (LCNN) Model The previous experiments suggest that K nearest neighbor (KNN) is the best background subtraction model for our task

Read more

Summary

Introduction

I N recent years, commercial satellite technology has achieved significant development in capturing highresolution videos. In 2015, Chang Guang Satellite Technology Co. Ltd (China), known as CGSTL, launched two video satellites Jilin and Jilin, which could acquire color videos. The background subtraction model (BGM) is a widely used approach for detecting moving objects from videos. Several background subtraction models have been proposed and widely used in ground surveillance systems. The GMM-based model has been widely used for foreground detection and recognized as one of the best models for video surveillance systems [20]. A recently proposed background model, called ViBe (Visual Background Extractor) [21], is considered a novel method better than GMM. Compared to GMM, ViBe can shorten modeling time and performs well under changing background conditions

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call