MS-Faster R-CNN: Multi-Stream Backbone for Improved Faster R-CNN Object Detection and Aerial Tracking from UAV Images

Danilo Avola,Luigi Cinque,Daniele Pannone,Claudio Piciarelli,Gian Luca Foresti,Alessio Fagioli,Alessio Mecca,Anxhelo Diko

doi:10.3390/rs13091670

Danilo Avola, Luigi Cinque + Show 6 more

Open Access

https://doi.org/10.3390/rs13091670

Copy DOI

Journal: Remote Sensing	Publication Date: Apr 25, 2021
Citations: 52	License type: CC BY 4.0

Affiliation: Sapienza University of Rome, University of Udine

Abstract

Tracking objects across multiple video frames is a challenging task due to several difficult issues such as occlusions, background clutter, lighting as well as object and camera view-point variations, which directly affect the object detection. These aspects are even more emphasized when analyzing unmanned aerial vehicles (UAV) based images, where the vehicle movement can also impact the image quality. A common strategy employed to address these issues is to analyze the input images at different scales to obtain as much information as possible to correctly detect and track the objects across video sequences. Following this rationale, in this paper, we introduce a simple yet effective novel multi-stream (MS) architecture, where different kernel sizes are applied to each stream to simulate a multi-scale image analysis. The proposed architecture is then used as backbone for the well-known Faster-R-CNN pipeline, defining a MS-Faster R-CNN object detector that consistently detects objects in video sequences. Subsequently, this detector is jointly used with the Simple Online and Real-time Tracking with a Deep Association Metric (Deep SORT) algorithm to achieve real-time tracking capabilities on UAV images. To assess the presented architecture, extensive experiments were performed on the UMCD, UAVDT, UAV20L, and UAV123 datasets. The presented pipeline achieved state-of-the-art performance, confirming that the proposed multi-stream method can correctly emulate the robust multi-scale image analysis paradigm.

Highlights

The data used for this work was taken from four well-known benchmarks in unmanned aerial vehicles (UAV) object detection and tracking, namely UMCD [40], UAVDT [38], UAV123 [39], and UAV20L [39]
The data are rich in task specific attributes that give the possibility to experiment in different conditions like different altitudes, occlusion, camera motion, background clutter, and more
The detection and the tracking of an object within the environment is strongly influenced by the UAV flight height, especially if it changes continuously during the acquisition

Summary

Introduction

Computer Vision has been involved in several, heterogeneous tasks which include rehabilitation [1,2,3,4,5], virtual/augmented reality [6,7,8,9,10], deception detection [11,12,13,14], robotics [15,16,17,18,19], and much more Focusing on the latter, one of the most prominent applications involves the usage of drones (hereinafter, UAVs).

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MS-Faster R-CNN: Multi-Stream Backbone for Improved Faster R-CNN Object Detection and Aerial Tracking from UAV Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Comparison of UAV and WorldView-2 imagery for mapping leaf area index of mangrove forest
Jinyan Tian ... Xiaomeng Liu
International Journal of Applied Earth Observation and Geoinformation | VOL. 61
Jinyan Tian, et. al.Jinyan Tian ... Xiaomeng Liu
12 May 2017
International Journal of Applied Earth Observation and Geoinformation | VOL. 61

Impact of Texture Information on Crop Classification with Machine Learning and UAV Images
Geun-Ho Kwak ... No-Wook Park
Applied Sciences | VOL. 9
Geun-Ho Kwak, et. al.Geun-Ho Kwak ... No-Wook Park
14 Feb 2019
Applied Sciences | VOL. 9

Automatic UAV Image Geo-Registration by Matching UAV Images to Georeferenced Image Data
Xiangyu Zhuo ... Tobias Koch
Remote Sensing | VOL. 9
Xiangyu Zhuo, et. al.Xiangyu Zhuo ... Tobias Koch
17 Apr 2017
Remote Sensing | VOL. 9

Rapid Mosaicking of Unmanned Aerial Vehicle (UAV) Images for Crop Growth Monitoring Using the SIFT Algorithm
Jianqing Zhao ... Weixing Cao
Remote Sensing | VOL. 11
Jianqing Zhao, et. al.Jianqing Zhao ... Weixing Cao
23 May 2019
Remote Sensing | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MS-Faster R-CNN: Multi-Stream Backbone for Improved Faster R-CNN Object Detection and Aerial Tracking from UAV Images

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Remote Sensing