Comparison between Calculation Methods for Semantic Text Similarity based on Siamese Networks

Keyang Wang,Yiping Zeng,Fanyu Meng,Feiyu Feiyu,Lili Yang

doi:10.1145/3478905.3478981

Abstract

In the era of information explosion, people are eager to obtain contents that meet their own needs and interests from massive amounts of information. Therefore, how to understand the needs of Internet users correctly and effectively is one of the urgent problems to be solved. In this case, semantic text similarity task is useful in many application scenarios. To measure semantic text similarity based on text matching model, several Siamese networks are constructed in this paper. Specifically, we firstly use the Stsbenchmark dataset, regarding the GloVe, BERT and DistilBERT as initial models, and add deep neural networks to train and fine-tune, fully utilizing the advantages of the existing models. Next, we test several similarity calculation methods to quantify the semantic similarity of sentence pairs. Moreover, the Pearson and Spearman correlation coefficients are used as evaluation indicators to compare the sentence embedding effects of different models. Finally, experiment result shows the Siamese network based on BERT model has the optimal effect among all, with the highest accuracy rate up to 84.5%. While among several similarity calculation methods, the Cosine Similarity usually obtain the best accuracy rate. In the future, this model can be appropriately used in semantic text similarity tasks, through matching texts between users’ needs and knowledge base. In this way, we can improve machines' language understanding ability as well as meeting the diverse needs of users.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison between Calculation Methods for Semantic Text Similarity based on Siamese Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A BERT-GRU Model for Measuring the Similarity of Arabic Text
Rakia Saidi ... Didier Schwab
JUCS - Journal of Universal Computer Science | VOL. 30
Rakia Saidi, et. al.Rakia Saidi ... Didier Schwab
28 Jun 2024
JUCS - Journal of Universal Computer Science | VOL. 30

Подходы к оценке семантического сходства текстов в многоязычном пространстве
Aida Hakimova ... Evgenii Sokolov
-
Aida Hakimova, et. al.Aida Hakimova ... Evgenii Sokolov
23 Nov 2020
23 Nov 2020

Semantic textual similarity between sentences using bilingual word semantics
Md Shajalal ... Masaki Aono
Progress in Artificial Intelligence | VOL. 8
Md Shajalal, et. al.Md Shajalal ... Masaki Aono
09 Mar 2019
Progress in Artificial Intelligence | VOL. 8

The 2019 n2c2/OHNLP Track on Clinical Semantic Textual Similarity: Overview.
Yanshan Wang ... Sunyang Fu
JMIR medical informatics | VOL. 8
Yanshan Wang, et. al.Yanshan Wang ... Sunyang Fu
27 Nov 2020
The 2019 n2c2/OHNLP Track on Clinical Semantic Textual Similarity: Overview.
Yanshan Wang ... Sunyang Fu

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison between Calculation Methods for Semantic Text Similarity based on Siamese Networks

Abstract

Talk to us

Similar Papers