Rethinking prediction alignment in one-stage object detection

Junrui Xiao,He Jiang,Zhikai Li,Qingyi Gu

doi:10.1016/j.neucom.2022.09.132

Abstract

Owing to their excellent performance and efficiency, one-stage detectors have been widely used in multimedia tasks, such as temporal action detection, object tracking, and video detection. However, misalignment between classification and regression branches limits the accuracy of the detector. Most existing works add an auxiliary branch or adopt a specific sample assignment strategy to alleviate this problem, but with little effect. In this paper, we attribute this to incomplete branch interactions and propose a comprehensive Predictive Aligned Object Detector (PAOD), which can better correlate two subtasks. Specifically, our proposed PAOD achieves a better trade-off between prediction-interactive and prediction-specific by adopting an Iterative Aggregation Module (IAM) and a Mutual Constraint Module (MCM). We also design an aligned label assignment with an adaptive metric and re-weighting mechanism to further narrow the misalignment between prediction heads. With negligible additional overhead, PAOD achieves 50.4 AP at single-model single-scale testing on the MS-COCO branch, which demonstrates the effectiveness of our proposal. Notably, PAOD consistently outperforms previous sota such as ATSS (47.7 AP), BorderDet (48.0 AP) and GFL (48.2 AP) by a large margin on COCO test-dev dataset, and achieves better performance than various dense detectors on Pascal VOC and CrowdHuman datasets. Code is available at https://github.com/JunruiXiao/PAOD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rethinking prediction alignment in one-stage object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Sep 28, 2022
Citations: 8

Similar Papers

Balanced One-Stage Object Detection by Enhancing the Effect of Positive Samples
Zuyi Wang ... Wenjun Zhu
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Zuyi Wang, et. al.Zuyi Wang ... Wenjun Zhu
01 Aug 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Balanced knowledge distillation for one-stage object detector
Sungwook Lee ... Byung Cheol Song
Neurocomputing | VOL. 500
Sungwook Lee, et. al.Sungwook Lee ... Byung Cheol Song
26 May 2022
Neurocomputing | VOL. 500

An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network
Zhong Qu ... Tu-Ming Yi
Image and Vision Computing | VOL. 125
Zhong Qu, et. al.Zhong Qu ... Tu-Ming Yi
01 Sep 2022
Image and Vision Computing | VOL. 125

M2Det: A Single-Shot Object Detector Based on Multi-Level Feature Pyramid Network
Qijie Zhao ... Ling Cai
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Qijie Zhao, et. al.Qijie Zhao ... Ling Cai
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rethinking prediction alignment in one-stage object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing