Learning to Match Anchors for Visual Object Detection.

Xiaosong Zhang,Xiangyang Ji,Chang Liu,Fang Wan,Qixiang Ye

doi:10.1109/tpami.2021.3050494

Abstract

Modern CNN-based object detectors assign anchors for ground-truth objects under the restriction of object-anchor Intersection-over-Union (IoU). In this study, we propose a learning-to-match (LTM) method to break IoU restriction, allowing objects to match anchors in a flexible manner. LTM updates hand-crafted anchor assignment to "free" anchor matching by formulating detector training in the Maximum Likelihood Estimation (MLE) framework. During the training phase, LTM is implemented by converting the detection likelihood to anchor matching loss functions which are plug-and-play. Minimizing the matching loss functions drives learning and selecting features which best explain a class of objects with respect to both classification and localization. LTM is extended from anchor-based detectors to anchor-free detectors, validating the general applicability of learnable object-feature matching mechanism for visual object detection. Experiments on MS COCO dataset demonstrate that LTM detectors consistently outperform counterpart detectors with significant margins. The last but not the least, LTM requires negligible computational cost in both training and inference phases as it does not involve any additional architecture or parameter. Code has been made publicly available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning to Match Anchors for Visual Object Detection.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence

Lead the way for us

Journal: IEEE transactions on pattern analysis and machine intelligence	Publication Date: Jan 12, 2021
Citations: 120

Similar Papers

A Novel Spectrum Sensing Mechanism Based on Distribution Discontinuity Estimation within Cognitive Radio
Yogesh Nijsure ... Golnaz Ghodoosipour
-
Yogesh Nijsure, et. al.Yogesh Nijsure ... Golnaz Ghodoosipour
01 Sep 2016
01 Sep 2016

Modeling Multisensory Integration
Loes C J Van Dam ... Cesare V Parise
-
Loes C J Van Dam, et. al.Loes C J Van Dam ... Cesare V Parise
28 Nov 2014
28 Nov 2014

Modeling inter-camera space–time and appearance relationships for tracking across non-overlapping views
Omar Javed ... Mubarak Shah
Computer Vision and Image Understanding | VOL. 109
Omar Javed, et. al.Omar Javed ... Mubarak Shah
27 Feb 2007
Computer Vision and Image Understanding | VOL. 109

Skeleton-Based Human Motion Prediction With Privileged Supervision.
Minjing Dong ... Chang Xu
IEEE transactions on neural networks and learning systems | VOL. 34
Minjing Dong, et. al.Minjing Dong ... Chang Xu
01 Dec 2023
IEEE transactions on neural networks and learning systems | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to Match Anchors for Visual Object Detection.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence