Abstract

Sentence similarity is widely used in various natural language tasks such as natural language inference, paraphrase identification, and question answering. However, a variety of linguistic expressions and ambiguities of words in sentences make it difficult to measure sentence similarity. Many studies show that using local features or global features of a sentence will produce satisfactory sentence representations that can be utilized to measure sentence similarity. Local features reflect the relationships of adjacent words for each sentence and the sequence information of a sentence are usually expressed by global features. However, local features lack abilities to capture sequence information while a small amount of extracted global features is not enough to produce sentence representations with good qualities. In this paper, we propose A Hybrid Model combining Local and Global features into a Siamese Network (HM-LGSN) for sentence similarity calculation. We first propose a new convolution neural network architecture called group convolution neural network to extract the most representative local features (or word semantic features). Then we combine these new features with pre-trained embeddings of words as input to the Bidirectional Gated Recurrent Units to extract global features of sentences. Finally, we select the global features to form sentence representations and calculate sentence similarity through Manhattan distance. The experimental results on SICK, MSRVID, STS-B datasets show that the accuracy of our proposed model is significantly improved by combining local features and global features.

Highlights

  • Sentence similarity is a challenging research task with applications in many Natural Language Processing (NLP) tasks, such as question answering [1], document summarization [2], and sentence generation [3]

  • We firstly propose a novel convolution neural network architecture called group convolution neural network (G-Convolutional Neural Network (CNN)) to extract local features and get two types of feature maps derived from a large number of convolution filters

  • We present a hybrid model based on a Siamese network for sentence similarity calculation

Read more

Summary

INTRODUCTION

Sentence similarity is a challenging research task with applications in many Natural Language Processing (NLP) tasks, such as question answering [1], document summarization [2], and sentence generation [3]. By generating so-called sentence vectors, similarity can be measured in different ways, such as cosine distance He et al [14] use a Convolutional Neural Network (CNN) to extract features of sentences in different granularity and propose a new feature filtering algorithm to form final sentence representations. We propose a Hybrid Model combining Local and Global features into a Siamese Network (HM-LGSN) for sentence similarity calculation. We use the Bidirectional Gated Recurrent Unit (Bi-GRU) to extract global features of a sentence with the input of new word semantic features In this way, final sentence representations will contain both local and global information of sentences. We present a hybrid model based on a Siamese network for sentence similarity calculation This model integrates local features (through G-CNN) and global features (through Bi-GRU) to produce better sentence representations.

RELATED WORK
SENTENCE ENCODING
SENTENCE SIMILARITY CALCULATING
TRAINING DETAILS
DATASETS
Findings
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call