Abstract

The problem of measuring sentence similarity is an essential issue in the natural language processing area. It is necessary to measure the similarity between sentences accurately. Sentence similarity measuring is the task of finding semantic symmetry between two sentences, regardless of word order and context of the words. There are many approaches to measuring sentence similarity. Deep learning methodology shows a state-of-the-art performance in many natural language processing fields and is used a lot in sentence similarity measurement methods. However, in the natural language processing field, considering the structure of the sentence or the word structure that makes up the sentence is also important. In this study, we propose a methodology combined with both deep learning methodology and a method considering lexical relationships. Our evaluation metric is the Pearson correlation coefficient and Spearman correlation coefficient. As a result, the proposed method outperforms the current approaches on a KorSTS standard benchmark Korean dataset. Moreover, it performs a maximum of a 65% increase than only using deep learning methodology. Experiments show that our proposed method generally results in better performance than those with only a deep learning model.

Highlights

  • With the advent of the big data era, many people are receiving services from it

  • Our research aims to improve the performance of methods for measuring the similarity between the Korean language sentences by combining a deep learning methodology and a methodology that considers lexical relationships

  • This study measured the similarity between the Korean language sentences by combining a deep learning methodology and a method that considers lexical relationships

Read more

Summary

Introduction

With the advent of the big data era, many people are receiving services from it. For example, you can be provided with decision-making strategies for social and economic development, or you can automatically find the information you want on the web [1,2]. Many new convenient services are emerging, such as product recommendations during shopping, human movement analysis, and health checks using data [3,4]. These days, natural language data such as social networks and news articles are pouring out; natural language processing have received tremendous attention [5]. Measuring the similarity between natural language sentences is important even within natural language processing [6]. Many approaches such as a chatbot system, plagiarism checking system, and automatic classification system all depend on sentence similarity. Measuring the similarity between two sentences is a crucial task

Methods
Findings
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call