Multiple Instance Detection Networks With Adaptive Instance Refinement

Zhihao Wu,Yong Xu,David Zhang,Jie Wen,Jian Yang

doi:10.1109/tmm.2021.3125130

Abstract

Weakly supervised object detection (WSOD) aims to train object detectors by using only image-level annotations. Many recent works on WSOD adopt multiple instance detection networks (MIDN), which usually generate a certain number of proposals and regard proposal classification as a latent model learning within image classification. However, these methods tend to detect salient object, salient object parts and clustered objects due to lack of instance-level annotations during training. Thus a core issue is how to guarantee that the network learn as many objects with precise bounding boxes as possible. In this paper, we address this issue by exploiting the potential of proposal scores during training. We propose an adaptive instance refinement (AIR) framework with three novel designs, which can be integrated with MIDN into a single network. Specifically, adaptive instance mining attempts to discover all positive instances according to the score distribution of proposals and their spatial similarity. Adaptive score modulation dynamically adjusts proposal scores to make the network focus more on instances with different difficulties in different training iterations. Adaptive knowledge refinement distills important information from all previous stages by the weighted average of proposal scores. The experimental results on the PASCAL VOC 2007 and 2012 benchmarks and the MS COCO benchmark demonstrate that AIR significantly improves the performance of the original MIDN and achieves the state-of-the-art results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Multimedia	Publication Date: Jan 1, 2023
Citations: 20	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Multiple Instance Detection Networks With Adaptive Instance Refinement

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Similar Papers

Efficient Weakly-Supervised Object Detection With Pseudo Annotations
Qingsheng Yuan ... Biao Leng
IEEE Access | VOL. 9
Qingsheng Yuan, et. al.Qingsheng Yuan ... Biao Leng
01 Jan 2020
IEEE Access | VOL. 9

Multi-peak Graph-based Multi-instance Learning for Weakly Supervised Object Detection
Ruyi Ji ... Chen Zhao
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 17
Ruyi Ji, et. al.Ruyi Ji ... Chen Zhao
14 Jun 2021
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 17

Weakly Supervised Region Proposal Network and Object Detection
Peng Tang ... Angtian Wang
-
Peng Tang, et. al.Peng Tang ... Angtian Wang
01 Jan 2018
01 Jan 2018

C-MIDN: Coupled Multiple Instance Detection Network With Segmentation Guidance for Weakly Supervised Object Detection
Gao Yan ... Xiaochun Ye
-
Gao Yan, et. al.Gao Yan ... Xiaochun Ye
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple Instance Detection Networks With Adaptive Instance Refinement

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia