MKPM: Multi keyword-pair matching for natural language sentences

Ting Sun,Jun Feng,Richard Sutcliffe,Xia Sun,Yi Gao,Yao Deng,Xin Lu

doi:10.1007/s10489-021-02306-5

Ting Sun, Jun Feng + Show 5 more

Open Access

https://doi.org/10.1007/s10489-021-02306-5

Copy DOI

Journal: Applied Intelligence	Publication Date: May 31, 2021
Citations: 8	License type: open-access

Affiliation: Northwest University, University of Essex

Abstract

Sentence matching is widely used in various natural language tasks, such as natural language inference, paraphrase identification and question answering. For these tasks, we need to understand the logical and semantic relationship between two sentences. Most current methods use all information within a sentence to build a model and hence determine its relationship to another sentence. However, the information contained in some sentences may cause redundancy or introduce noise, impeding the performance of the model. Therefore, we propose a sentence matching method based on multi keyword-pair matching (MKPM), which uses keyword pairs in two sentences to represent the semantic relationship between them, avoiding the interference of redundancy and noise. Specifically, we first propose a sentence-pair-based attention mechanism sp-attention to select the most important word pair from the two sentences as a keyword pair, and then propose a Bi-task architecture to model the semantic information of these keyword pairs. The Bi-task architecture is as follows: 1. In order to understand the semantic relationship at the word level between two sentences, we design a word-pair task (WP-Task), which uses these keyword pairs to complete sentence matching independently. 2. We design a sentence-pair task (SP-Task) to understand the sentence level semantic relationship between the two sentences by sentence denoising. Through the integration of the two tasks, our model can understand sentences more accurately from the two granularities of word and sentence. Experimental results show that our model can achieve state-of-the-art performance in several tasks. Our source code is publicly available1.

Highlights

Extended author information available on the last page of the article.given response can answer a question correctly [4, 5]
We first evaluate the performance of the model on the two Chinese paraphrase identification datasets Large-scale Chinese Question Matching Corpus (LCQMC) and Bank Question (BQ); in order to prove that our method is suitable for different languages, we apply the model to the English dataset Quora Question Pair (QQP)
In order to demonstrate that our method is suitable for different sentence matching tasks, we evaluate it on the natural language inference dataset Stanford Natural Language Inference (SNLI)

Summary

Introduction

Given response can answer a question correctly [4, 5] In these tasks, it is not easy to correctly predict the relationship between two sentences [6], due to the diversity of language expression and the complexity of sentence semantics. Because a sentenceencoding based method cannot capture the interactive features between two sentences, people usually take advantage of the joint-feature based method, which utilizes the interactive features or attention information across two sentences to encode them, resulting in a significant improvement in model performance. It has been found that using a deeper model structure can improve the performance of the model This kind of structure can extract the deeper semantic features and dependency relationships of sentences [11]

Methods

Results

Discussion

Conclusion