Abstract

Weakly Supervised Object Localization (WSOL) is a technique for obtaining the object location from attention maps of the classification network, without using bounding box annotations. Existing WSOL approaches lack the modeling of the correlations between different regions of the target object. Hence they can only locate some discriminative attentions, which are small and sparse. Besides, they introduce too many background attentions when mining more object parts. In this paper, we propose a novel Adaptive Attention Augmentor (A3) to adaptively augment the target object attentions on class attention maps. It can supplement object attentions by discovering the semantic correspondence between different regions and dynamically suppress background attentions through the proposed Focal Dice loss. Extensive experiments demonstrate the effectiveness of our approach. On the ILSVRC dataset, A3 achieves a new state-of-the-art localization performance. On the fine-grained datasets including CUB-200–2011 and Cars-196, it also achieves very competitive results.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call