Abstract

ABSTRACT Object detection in aerial images has become a focus in recent years due to the expansion of its application fields. Aerial objects have the characteristics of multi-scale, arbitrary angle and dense arrangement, which brings considerable challenges to the task. Based on the efficient one-stage detectors, most methods design modules to adaptively align the feature receptive field with the anchors to improve the accuracy of the bounding box regression and calculate loss according to parameter bias. Current methods are not adequately data-driven in feature optimization and the angle parameter regression faces boundary problems. To handle these problems, in this letter, we propose an Align-Focus detector (AFDet) for aerial image object detection. We introduce the ideology of deformable multi-head self-attention to feature optimization and design a new Align-Focus Module (AFM) which can sensitively focus feature encoding response to the real texture area of the objects. In addition, we apply KF-IOU Loss to solve the boundary problem. We conduct necessary experiments on DOTA and HRSC2016 datasets, and AFDet achieves the highest mAP performance compared to the existing methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call