PTF‐SimCM: A Simple Contrastive Model with Polysemous Text Fusion for Visual Similarity Metric

Xinpan Yuan,Chengyuan Zhang,Zhiqi Zhang,Xinxin Mao,Shaojun Xie,Wei Xia

doi:10.1155/2022/2343707

Abstract

Image similarity metric, also known as metric learning (ML) in computer vision, is a significant step in various advanced image tasks. Nevertheless, existing well‐performing approaches for image similarity measurement only focus on the image itself without utilizing the information of other modalities, while pictures always appear with the described text. Furthermore, those methods need human supervision, yet most images are unlabeled in the real world. Considering the above problems comprehensively, we present a novel visual similarity metric model named PTF‐SimCM. It adopts a self‐supervised contrastive structure like SimSiam and incorporates a multimodal fusion module to utilize textual modality correlated to the image. We apply a cross‐modal model for text modality rather than a standard unimodal text encoder to improve late fusion productivity. In addition, the proposed model employs Sentence PIE‐Net to solve the issue caused by polysemous sentences. For simplicity and efficiency, our model learns a specific embedding space where distances directly correspond to the similarity. Experimental results on MSCOCO, Flickr 30k, and Pascal Sentence datasets show that our model overall outperforms all the compared methods in this work, which illustrates that the model can effectively address the issues faced and enhance the performances on unsupervised visual similarity measuring relatively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PTF‐SimCM: A Simple Contrastive Model with Polysemous Text Fusion for Visual Similarity Metric

Abstract

Talk to us

Similar Papers

More From: Complexity

Lead the way for us

Journal: Complexity	Publication Date: Jan 1, 2022
License type: CC BY 4.0

Similar Papers

Learning a hybrid similarity measure for image retrieval
Jun Wu ... Chun-Li Wang
Pattern Recognition | VOL. 46
Jun Wu, et. al.Jun Wu ... Chun-Li Wang
20 Apr 2013
Pattern Recognition | VOL. 46

Visual letter similarity effects during sentence reading: Evidence from the boundary technique
Ana Marcet ... Manuel Perea
Acta Psychologica | VOL. 190
Ana Marcet, et. al.Ana Marcet ... Manuel Perea
14 Aug 2018
Acta Psychologica | VOL. 190

Image Similarity based on a Distributional "Metric" for Multivariate Data
Christos Theoharatos ... Nikolaos A.
-
Christos Theoharatos, et. al.Christos Theoharatos ... Nikolaos A.
01 Jun 2007
01 Jun 2007

A New Similarity Measure with Deformation Detection of Visual Salient Regions for Image Retargeting
Canlin Li ... Fubao Zhu
International Journal of Multimedia and Ubiquitous Engineering | VOL. 9
Canlin Li, et. al.Canlin Li ... Fubao Zhu
31 Jul 2014
International Journal of Multimedia and Ubiquitous Engineering | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PTF‐SimCM: A Simple Contrastive Model with Polysemous Text Fusion for Visual Similarity Metric

Abstract

Talk to us

Similar Papers

More From: Complexity