Abstract

As a kind of single-stage detection network, RetinaNet has excellent performance in object detection, but it still has some difficulties in small object detection in unmanned aerial vehicle (UAV) images. Based on RetinaNet, this paper uses improved ResNet50 as backbone to extract features. We increase the number of anchor ratios, but reduce the number of anchor scales so that the number of anchors won't increase too much. We add the shallower features to predict smaller targets. We also add Receptive Field Block after each predicted feature to increase the receptive field. We have changed the values of parameters in the loss function for better detection performance. The improved model effectively increases the detection rate of small targets so that it can be applied to pedestrian detection under UAV. In our UAV pedestrian dataset, the improved model increases average precision (AP) from 51.29% to 84.05%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call