A method for image–text matching based on semantic filtering and adaptive adjustment

Ran Jin,Tengda Hou,Tao Jin,Jie Yuan,Chenjie Du

doi:10.1186/s13640-024-00639-y

Abstract

As image–text matching (a critical task in the field of computer vision) links cross-modal data, it has captured extensive attention. Most of the existing methods intended for matching images and texts explore the local similarity levels between images and sentences to align images with texts. Even though this fine-grained approach has remarkable gains, how to further mine the deep semantics between data pairs and focus on the essential semantics in data remains to be quested. In this work, a new semantic filtering and adaptive approach (FAAR) was proposed to ease the above problem. To be specific, the filtered attention (FA) module selectively focuses on typical alignments with the interference of meaningless comparisons eliminated. Next, the adaptive regulator (AR) further adjusts the attention weights of key segments for filtered regions and words. The superiority of our proposed method was validated by a number of qualitative experiments and analyses on the Flickr30K and MSCOCO data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A method for image–text matching based on semantic filtering and adaptive adjustment

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing

Lead the way for us

Journal: EURASIP Journal on Image and Video Processing	Publication Date: Aug 29, 2024
License type: cc-by-nc-nd

Similar Papers

Saliency-Guided Attention Network for Image-Sentence Matching
Zhong Ji ... Yanwei Pang
-
Zhong Ji, et. al.Zhong Ji ... Yanwei Pang
01 Oct 2019
01 Oct 2019

End-to-end training image-text matching network
Depeng Wang ... Chen Hong
-
Depeng Wang, et. al.Depeng Wang ... Chen Hong
01 Jul 2022
01 Jul 2022

Image captioning using relevance attention and ITEM encoding
Hongliang Zhang ... Guangming Li
-
Hongliang Zhang, et. al.Hongliang Zhang ... Guangming Li
14 Dec 2021
14 Dec 2021

Optimization of urban and rural ecological spatial planning based on deep learning under the concept of sustainable development
Yilin Lai
Results in Engineering | VOL. 19
Yilin LaiYilin Lai
09 Aug 2023
Results in Engineering | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A method for image–text matching based on semantic filtering and adaptive adjustment

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing