Human Detection Under UAV: an Improved Faster R-CNN Approach

Yayun Qi,Hanshan Zhu,Ning Li,Haochen Shi,Huiyu Zhou

doi:10.1109/icsai.2018.8599511

Abstract

Although Faster R-CNN has excellent performance in object detection, it still has some difficulties in detecting small targets and slightly overlapped targets in UAV (Unmanned Aerial Vehicle) images. Based on Faster R-CNN, this paper uses ResNet101 as a feature extractor. We increase the number of anchors from 9 to 15 in RPN so that the small targets can match more anchors and get sufficient training. Due to the increasement of anchors, this paper introduces a 1$\times$1 convolution layer to integrate features and reduce the feature map channels. We also apply RoIAlign to avoid the misalignment caused by RoIPool. The improved model effectively increases the detection rate of small targets and slightly overlapped targets so that it can be applied to human detection under UAV. The improved model can detect small targets with a size of about 30$\times$80 pixels on aerial images with resolution of 3840$\times$2160 pixels. Compared with Faster R-CNN, the improved model increases AP (Average Precision) from 74.31% to 79.77% on the WILDTRACK dataset.

Full Text