Action-Aware Embedding Enhancement for Image-Text Retrieval

Jiangtong Li,Liqing Zhang,Li Niu

doi:10.1609/aaai.v36i2.20020

Abstract

Image-text retrieval plays a central role in bridging vision and language, which aims to reduce the semantic discrepancy between images and texts. Most of existing works rely on refined words and objects representation through the data-oriented method to capture the word-object cooccurrence. Such approaches are prone to ignore the asymmetric action relation between images and texts, that is, the text has explicit action representation (i.e., verb phrase) while the image only contains implicit action information. In this paper, we propose Action-aware Memory-Enhanced embedding (AME) method for image-text retrieval, which aims to emphasize the action information when mapping the images and texts into a shared embedding space. Specifically, we integrate action prediction along with an action-aware memory bank to enrich the image and text features with action-similar text features. The effectiveness of our proposed AME method is verified by comprehensive experimental results on two benchmark datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Action-Aware Embedding Enhancement for Image-Text Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 13

Similar Papers

Memorize, Associate and Match: Embedding Enhancement via Fine-Grained Alignment for Image-Text Retrieval.
Jiangtong Li ... Liqing Zhang
IEEE Transactions on Image Processing | VOL. 30
Jiangtong Li, et. al.Jiangtong Li ... Liqing Zhang
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Global-aware Fragment Representation Aggregation Network for image-text retrieval
Di Wang ... Lihuo He
Pattern Recognition | VOL. -
Di Wang, et. al.Di Wang ... Lihuo He
01 Oct 2024
Pattern Recognition | VOL. -

High-Accuracy Tomato Leaf Disease Image-Text Retrieval Method Utilizing LAFANet.
Jiaxin Xu ... Guoxiong Zhou
Plants | VOL. 13
Jiaxin Xu, et. al.Jiaxin Xu ... Guoxiong Zhou
23 Apr 2024
Plants | VOL. 13

Review of Recent Deep Learning Based Methods for Image-Text Retrieval
Jianan Chen ... Cong Bai
-
Jianan Chen, et. al.Jianan Chen ... Cong Bai
17 Feb 2020
17 Feb 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action-Aware Embedding Enhancement for Image-Text Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence