Abstract

Most anchor-free methods perform object detection using dense recommendation, which assumes that one point can simultaneously conduct accurate category prediction and regression estimation. However, due to different task drivers, valid features for classification and regression may locate at distinct areas in the training phase. This problem is called feature misalignment. To solve it, we propose a new feature alignment method based on anchor-free object detector. Firstly, a global receptive field adaptor (G-RFA) is designed by incorporating the feature pyramid networks (FPN) with the global attention mechanism, and forward features are further fine-tuned with a deformable-subnet (De-Subnet) to remove the influence of redundant contextual information. Then, a new feature filter strategy with a misalignment score is proposed to guide the network to focus on sampling points with aligned features. In addition, we establish mutually independent multi-layer quality distributions to model the priori information of an object on different FPN levels. Equipped with our method, the classification and regression features are aligned, and the generated foreground weight map converges to the centers of classification and regression heatmaps. Experimental results show that without bells and whistles, our method achieves 49.3% AP on MS COCO test-dev under the default 2x training schedule, outperforming related methods. Besides, experiments on PASCAL VOC demonstrate the generalization ability of our method. Code is available at https://github.com/GFENGG/featurealign.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call