Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding

Xuejing Liu,Dechao Meng,Qingming Huang,Liang Li,Zheng-Jun Zha,Shuhui Wang

doi:10.1109/iccv.2019.00270

Abstract

Weakly supervised referring expression grounding aims at localizing the referential object in an image according to the linguistic query, where the mapping between the referential object and query is unknown in the training stage. To address this problem, we propose a novel end-to-end adaptive reconstruction network (ARN). It builds the correspondence between image region proposal and query in an adaptive manner: adaptive grounding and collaborative reconstruction. Specifically, we first extract the subject, location and context features to represent the proposals and the query respectively. Then, we design the adaptive grounding module to compute the matching score between each proposal and query by a hierarchical attention model. Finally, based on attention score and proposal features, we reconstruct the input query with a collaborative loss of language reconstruction loss, adaptive reconstruction loss, and attribute classification loss. This adaptive mechanism helps our model to alleviate the variance of different referring expressions. Experiments on four large-scale datasets show ARN outperforms existing state-of-the-art methods by a large margin. Qualitative results demonstrate that the proposed ARN can better handle the situation where multiple objects of a particular category situated together.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Entity-Enhanced Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding.
Xuejing Liu ... Liang Li
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Xuejing Liu, et. al.Xuejing Liu ... Liang Li
01 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Adaptive modulation attention-based face super-resolution reconstruction method
Zhenglei Xie ... Fengsui Wang
-
Zhenglei Xie, et. al.Zhenglei Xie ... Fengsui Wang
09 Dec 2022
09 Dec 2022

Adaptive Threshold-based Sparse Representation Network for Image Compressive Sensing Reconstruction
Yunyi Xuan ... Chunling Yang
-
Yunyi Xuan, et. al.Yunyi Xuan ... Chunling Yang
05 Dec 2021
05 Dec 2021

Dynamic Sparse R-CNN
Qinghang Hong ... Yi Shan
-
Qinghang Hong, et. al.Qinghang Hong ... Yi Shan
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding

Abstract

Talk to us

Similar Papers