Dependency-based Siamese long short-term memory network for learning sentence representations.

Wenhao Zhu,Baogang Wei,Tengjun Yao,Jianyue Ni,Zhiguo Lu

doi:10.1371/journal.pone.0193919

Abstract

Textual representations play an important role in the field of natural language processing (NLP). The efficiency of NLP tasks, such as text comprehension and information extraction, can be significantly improved with proper textual representations. As neural networks are gradually applied to learn the representation of words and phrases, fairly efficient models of learning short text representations have been developed, such as the continuous bag of words (CBOW) and skip-gram models, and they have been extensively employed in a variety of NLP tasks. Because of the complex structure generated by the longer text lengths, such as sentences, algorithms appropriate for learning short textual representations are not applicable for learning long textual representations. One method of learning long textual representations is the Long Short-Term Memory (LSTM) network, which is suitable for processing sequences. However, the standard LSTM does not adequately address the primary sentence structure (subject, predicate and object), which is an important factor for producing appropriate sentence representations. To resolve this issue, this paper proposes the dependency-based LSTM model (D-LSTM). The D-LSTM divides a sentence representation into two parts: a basic component and a supporting component. The D-LSTM uses a pre-trained dependency parser to obtain the primary sentence information and generate supporting components, and it also uses a standard LSTM model to generate the basic sentence components. A weight factor that can adjust the ratio of the basic and supporting components in a sentence is introduced to generate the sentence representation. Compared with the representation learned by the standard LSTM, the sentence representation learned by the D-LSTM contains a greater amount of useful information. The experimental results show that the D-LSTM is superior to the standard LSTM for sentences involving compositional knowledge (SICK) data.

Highlights

Learning textual representations is a vital part of natural language processing (NLP) and important for subsequent NLP tasks
This paper proposes the D-Long Short-Term Memory (LSTM) model, which can capture richer information about a sentence than the standard LSTM model and learn an efficient sentence representation
We noticed that dependency-based LSTM model (D-LSTM) (0.5) has a slightly worse mean squared error (MSE) than the top 1 SemEval 2014 submission

Summary

Introduction

Learning textual representations is a vital part of natural language processing (NLP) and important for subsequent NLP tasks. The study of representations of phrases and sentences has attracted the attention of many researchers, who have achieved a degree of success [1]. Researchers hope to directly learn sentence representation via the sum or average based on the word representation, and they have achieved satisfactory results for certain simple NLP tasks [4]. Because of the variable length and complex structure of sentences, these simple algorithms cannot handle complex tasks (such as evaluating the similarity between two sentences). To resolve this problem, Kiros, Tai and Le have proposed methods of learning fixed-length sentence representations [5,6,7]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Mar 7, 2018
Citations: 32	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Dependency-based Siamese long short-term memory network for learning sentence representations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

A Novel Temporal Feature Selection Based LSTM Model for Electrical Short-Term Load Forecasting
Khalid Ijaz ... Jameel Ahmad
IEEE Access | VOL. 10
Khalid Ijaz, et. al.Khalid Ijaz ... Jameel Ahmad
01 Jan 2021
IEEE Access | VOL. 10

Groundwater level modeling framework by combining the wavelet transform with a long short-term memory data-driven model
Chengcheng Wu ... Longcang Shu
Science of the Total Environment | VOL. 783
Chengcheng Wu, et. al.Chengcheng Wu ... Longcang Shu
08 Apr 2021
Science of the Total Environment | VOL. 783

Hyperparameter Tuning of Long Short-Term Memory Model for Clickbait Classification in News Headlines
Grace Yudha Satriawan ... Budi Prasetiyo
Recursive Journal of Informatics | VOL. 2
Grace Yudha Satriawan, et. al.Grace Yudha Satriawan ... Budi Prasetiyo
31 Mar 2024
Recursive Journal of Informatics | VOL. 2

Short-Term Prediction in Vessel Heave Motion Based on Improved LSTM Model
Gang Tang ... Shaoyang Men
IEEE Access | VOL. 9
Gang Tang, et. al.Gang Tang ... Shaoyang Men
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dependency-based Siamese long short-term memory network for learning sentence representations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE