The Influence of Loss Function Usage at SIAMESE Network in Measuring Text Similarity

Suprapto ,Joseph A

doi:10.14569/ijacsa.2020.0111290

Abstract

In a text matching similarity task, a model takes two sequence of text as an input and predicts a category or scale value to show their relationship. A developed model is to measure the similarity - one of relationship between those two text. The model is SIAMESE network that implement two copies of same network of CNN, it takes text_1 and text_2 as the inputs respectively for two CNN networks. The output of each CNN network is features vector of the corresponding text input, both outputs are then fed by a loss function to calculate the value of loss (i.e. similarity). This research implemented two types of loss functions, i.e. Triplet loss and Contrastive loss. The usage purpose of these two types of loss functions was to see the influence toward the measurement results of similarity between two text being compared. The metrices used for this comparison are precision, recall, and F1-score. Based on the experimental results done on 1500 pairs of sentences, and varied on the epoch value starting from 10 until 200 with an increment of 10, showed the best result was for epoch value of 180 with precision 0.8004, recall 0.6780, and F1-score 0.6713 for Triplet loss function; and epoch value of 160 with precision 0.6463, recall 0.6440, and F1-score 0.6451 for Contrastive loss function gave the best performance. So that, the Triplet loss function gave better influence than Contrastive loss function in measuring similarity between two given sentences.

Highlights

The very fast growth of information nowdays causes a particular problem, such as an overwhelming of information [21]. It is very likely among those collections of huge of information found some similar ones, so that, they can be grouped into several classes based on their similarity
Text similarity approach will ease people to find relevance information. It has a great support in successness for text mining operations such as, searching and information retrieval (IR), text classification, information extraction (IE), document clustering [8], sentiment analysis [4] [10] [16][3] [13], machine translation, text summarization, and natural language processing (NLP)
Text similarity measurement may be done by comparing text - text matching

Summary

Introduction

The very fast growth of information nowdays causes a particular problem, such as an overwhelming of information [21]. A text similarity measurements is one of text mining approach that capable of coping with the information overwhelming. This process begins with finding similar word for sentece, paragraph, and document [6]. Text similarity approach will ease people to find relevance information It has a great support in successness for text mining operations such as, searching and information retrieval (IR), text classification, information extraction (IE), document clustering [8], sentiment analysis [4] [10] [16][3] [13], machine translation, text summarization, and natural language processing (NLP). In order to make the alignment process fully used, model must take many external syntaxtical features or aligment as additional inputs at alignment layer [5] [7], adopt a complex alignment mechanism [17], or build a big number of post-process layers to analyze alignment results [7]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2020
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

The Influence of Loss Function Usage at SIAMESE Network in Measuring Text Similarity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

CCL-DTI: contributing the contrastive loss in drug–target interaction prediction
Alireza Dehghan ... Sajjad Gharaghani
BMC Bioinformatics | VOL. 25
Alireza Dehghan, et. al.Alireza Dehghan ... Sajjad Gharaghani
30 Jan 2024
BMC Bioinformatics | VOL. 25

Realization of Reliable and Effective Authentication in Intelligent Systems by Using Visual Biometrics Methods
Taras Batiuk ... Dmytro Dosyn
Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì | VOL. 15
Taras Batiuk, et. al.Taras Batiuk ... Dmytro Dosyn
15 Jul 2024
Vìsnik Nacìonalʹnogo unìversitetu "Lʹvìvsʹka polìtehnìka". Serìâ Ìnformacìjnì sistemi ta merežì | VOL. 15

Fisher Discriminant Triplet and Contrastive Losses for Training Siamese Networks
Benyamin Ghojogh ... Fakhri Karray
-
Benyamin Ghojogh, et. al.Benyamin Ghojogh ... Fakhri Karray
01 Jul 2020
01 Jul 2020

Triplet online instance matching loss for person re-identification
Ye Li ... Zhiguo Wang
Neurocomputing | VOL. 433
Ye Li, et. al.Ye Li ... Zhiguo Wang
17 Dec 2020
Neurocomputing | VOL. 433

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Influence of Loss Function Usage at SIAMESE Network in Measuring Text Similarity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications