Universal Relocalizer for Weakly Supervised Referring Expression Grounding

Panpan Zhang,Meng Liu,Zan Gao,Da Cao,Xuemeng Song,Liqiang Nie

doi:10.1145/3656045

Abstract

This article introduces the Universal Relocalizer, a novel approach designed for weakly supervised referring expression grounding. Our method strives to pinpoint a target proposal that corresponds to a specific query, eliminating the need for region-level annotations during training. To bolster the localization precision and enrich the semantic understanding of the target proposal, we devise three key modules: the category module, the color module, and the spatial relationship module. The category and color modules assign respective category and color labels to region proposals, enabling the computation of category and color scores. Simultaneously, the spatial relationship module integrates spatial cues, yielding a spatial score for each proposal to enhance localization accuracy further. By adeptly amalgamating the category, color, and spatial scores, we derive a refined grounding score for every proposal. Comprehensive evaluations on the RefCOCO, RefCOCO+, and RefCOCOg datasets manifest the prowess of the Universal Relocalizer, showcasing its formidable performance across the board.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Universal Relocalizer for Weakly Supervised Referring Expression Grounding

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

Non-Binary Turbo Coded Spatial Modulation
Shigeaki Hashimoto ... Koji Ishii
-
Shigeaki Hashimoto, et. al.Shigeaki Hashimoto ... Koji Ishii
01 Sep 2013
01 Sep 2013

Using phenotypic and genotypic big data to investigate the effect of muscle fiber characteristics on meat quality and eating quality traits in pigs
Liping Cai ... Lusheng Huang
Meat Science | VOL. 198
Liping Cai, et. al.Liping Cai ... Lusheng Huang
21 Jan 2023
Meat Science | VOL. 198

Luminance dependency of perceived color shift after color contrast adaptation caused by higher-order color channels.
Takehiro Nagai ... Yasuki Yamauchi
Journal of vision | VOL. 22
Takehiro Nagai, et. al.Takehiro Nagai ... Yasuki Yamauchi
28 Jun 2022
Journal of vision | VOL. 22

Performance analysis of cooperative spatial modulation with multiple-antennas at relay
Gokhan Altin ... Ertugrul Basar
-
Gokhan Altin, et. al.Gokhan Altin ... Ertugrul Basar
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Universal Relocalizer for Weakly Supervised Referring Expression Grounding

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications