Using Word Mover’s Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images

Hongxi Wei,Hui Zhang,Guanglai Gao,Xiangdong Su

doi:10.1007/978-3-319-70093-9_20

Abstract

In the framework of bag-of-visual-words, visual words are independent each other, which results in discarding spatial relations and lacking semantic information of visual words. To capture semantic information of visual words, a deep learning procedure similar to word embedding technique is used for mapping visual words to embedding vectors in a semantic space. And then, word mover’s distance (WMD) is utilized to measure similarity between two word images, which calculates the minimum traveling distance from the visual embeddings of one word image to another one. Moreover, word images are partitioned into several sub-regions with equal sizes along rows and columns in advance. After that, WMDs can be computed from the corresponding sub-regions of the two word images, separately. Thus, the similarity between the two word images is the sum of these WMDs. Experimental results show that the proposed method outperforms various baseline and state-of-the-art methods, including spatial pyramid matching, latent Dirichlet allocation, average visual word embeddings and the original word mover’s distance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Word Mover’s Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents
Hongxi Wei ... Guanglai Gao
-
Hongxi Wei, et. al.Hongxi Wei ... Guanglai Gao
01 Aug 2018
01 Aug 2018

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images
Hongxi Wei ... Hui Zhang
-
Hongxi Wei, et. al.Hongxi Wei ... Hui Zhang
01 Jul 2017
01 Jul 2017

Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images
Hongxi Wei ... Guanglai Gao
-
Hongxi Wei, et. al.Hongxi Wei ... Guanglai Gao
01 Jan 2018
01 Jan 2018

Fast paraphrase extraction in Ancient Greek literature
Marcus Pöckelmann ... Jörg Ritter
it - Information Technology | VOL. 62
Marcus Pöckelmann, et. al.Marcus Pöckelmann ... Jörg Ritter
06 Mar 2020
it - Information Technology | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Word Mover’s Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images

Abstract

Talk to us

Similar Papers