Globally Guided Confidence Enhancement Network for Image-Text Matching

Xin Dai,Mairidan Wushouer,Gulanbaier Tuerhong

doi:10.3390/app13095658

Abstract

Image-text matching is a crucial aspect of multi-modal intelligence. The main challenge in this area is accurately measuring the relevance between the image and text, using evidence obtained through matching. Previous studies either concentrated on obtaining a well-represented global feature to measure similarity directly or on investigating complex matching patterns at a local level before aggregating them, with little attention paid to combining them. We propose a Globally Guided Confidence Enhancement Network that combines both approaches by obtaining a good global representation to guide fine-grained local interactions. In this process, content that better matches the text from a global perspective is enhanced and represented with confidence scores. Extensive experiments demonstrate that the approach we have employed achieves superior performance on Flickr30K and MSCOCO datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: May 4, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Globally Guided Confidence Enhancement Network for Image-Text Matching

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

End-to-end training image-text matching network
Depeng Wang ... Yibo Sun
-
Depeng Wang, et. al.Depeng Wang ... Yibo Sun
01 Jul 2022
01 Jul 2022

Global-local fusion based on adversarial sample generation for image-text matching
Shichen Huang ... Shuai Liu
Information Fusion | VOL. 103
Shichen Huang, et. al.Shichen Huang ... Shuai Liu
20 Oct 2023
Information Fusion | VOL. 103

Fusion layer attention for image-text matching
Depeng Wang ... Anyu Du
Neurocomputing | VOL. 442
Depeng Wang, et. al.Depeng Wang ... Anyu Du
23 Feb 2021
Neurocomputing | VOL. 442

Hybrid Joint Embedding with Intra-Modality Loss for Image-Text Matching
Doaa B Ebaid ... Adel A El-Zoghabi
-
Doaa B Ebaid, et. al.Doaa B Ebaid ... Adel A El-Zoghabi
26 Nov 2022
26 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Globally Guided Confidence Enhancement Network for Image-Text Matching

Abstract

Talk to us

Similar Papers

More From: Applied Sciences