Word Representation Learning Based on Bidirectional GRUs With Drop Loss for Sentiment Classification

Xia Sun,Shou-Xi Guo,Jun Feng,Xin Wang,Richard Sutcliffe,Yi Gao

doi:10.1109/tsmc.2019.2940097

Xia Sun, Shou-Xi Guo + Show 4 more

Open Access

https://doi.org/10.1109/tsmc.2019.2940097

Copy DOI

Abstract

Sentiment classification is a fundamental task in many natural language processing applications. Neural networks have achieved great successes on the sentiment classification task in recent years, since recurrent neural networks and long-short-term memory networks have the ability to deal with sequences of different lengths and to capture contextual semantic information. However, the effectiveness of these methods is limited when used to extract contextual information from relatively long texts. Therefore, in our model, we apply bidirectional gated recurrent units to capture contextual information as far as possible when learning word representations, which may effectively reduce the noise compared to other methods. We also propose a novel loss function namely drop loss (DL) which makes the model focus on the hard examples - examples which are easily classified incorrectly - in order to improve the accuracy of the model. We experiment on four commonly used datasets, and the results show that the proposed method has a good performance on four datasets, and needs fewer parameters compared with recent benchmarks, such as CoVe, ULMFiT, embeddings from language models, and bidirectional encoder representations from transformers. Furthermore, we demonstrate that the classification performance of existing shallow network models can be significantly improved by using DL. In particular, the accuracy of the CNN+LSTM model improves 9% on the IMDB-10 dataset.

Highlights

S ENTIMENT classification is one of the most popularly used natural language processing (NLP) techniques and has been applied to many areas, such as E-commerce websites, stock forecast [1], and political orientation analyses [2]–[4]
We evaluate our model on the following datasets: the Stanford massive open online courses (MOOCs) posts datasets, IMDB, and the Sentiment Treebank
The results show that our approach may suffer from the data sparsity problem less and capture more contextual information of features compared with traditional methods using the BoW features

Summary

Introduction

S ENTIMENT classification is one of the most popularly used natural language processing (NLP) techniques and has been applied to many areas, such as E-commerce websites, stock forecast [1], and political orientation analyses [2]–[4]. In the sentiment classification task, feature-based representations play an important role, often based on the bag-of-words (BoWs) model [5], where bi-grams or larger n-grams are designed to represent features. A BoW model is used to represent documents by Pang et al [6] and Wang and Manning [7], who both build SVM classifiers for text classification. SVM is an extremely strong performer, the problem of data sparsity when using the BoW features heavily affects the classification accuracy [8]. Word embedding [9] has brought a new inspiration for solving the data sparsity problem to many NLP tasks [10], because it can represent each word as a low-dimensional, continuous, and real-valued vector [11]. Rao et al [21], Tang et al [11], and Xu et al [22] utilized word embeddings to present words before they use neural networks to learn word representations

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Systems, Man, and Cybernetics: Systems	Publication Date: Jul 1, 2021
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Word Representation Learning Based on Bidirectional GRUs With Drop Loss for Sentiment Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics: Systems

Lead the way for us

Similar Papers

Sentiment Classification Via Recurrent Convolutional Neural Networks
Changshun Du ... Lei Huang
DEStech Transactions on Computer Science and Engineering | VOL. -
Changshun Du, et. al.Changshun Du ... Lei Huang
21 Dec 2017
DEStech Transactions on Computer Science and Engineering | VOL. -

Recurrent Graph Neural Networks for Text Classification
Xinde Wei ... Ze Yang
-
Xinde Wei, et. al.Xinde Wei ... Ze Yang
16 Oct 2020
16 Oct 2020

A Comparative Sentiment Analysis of Airline Customer Reviews Using Bidirectional Encoder Representations from Transformers (BERT) and Its Variants
Zehong Li ... Chenyu Huang
Mathematics | VOL. 12
Zehong Li, et. al.Zehong Li ... Chenyu Huang
23 Dec 2023
Mathematics | VOL. 12

統計式語言模型 – 語音文件標記、檢索以及摘要

-

01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Word Representation Learning Based on Bidirectional GRUs With Drop Loss for Sentiment Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics: Systems