A novel approach to generate a large scale of supervised data for short text sentiment analysis

Xiao Sun,Jiajin He

doi:10.1007/s11042-018-5748-4

Abstract

As for the complexity of language structure, the semantic structure, and the relative scarcity of labeled data and context information, sentiment analysis has been regarded as a challenging task in Natural Language Processing especially in the field of short-text processing. Deep learning model need a large scale of training data to overcome data sparseness and the over-fitting problem, we propose multi-granularity text-oriented data augmentation technologies to generate large-scale artificial data for training model, which is compared with Generative adversarial network(GAN). In this paper, a novel hybrid neural network model architecture(LSCNN) was proposed with our data augmentation technology, which is can outperforms many single neural network models. The proposed data augmentation method enhances the generalization ability of the proposed model. Experiment results show that the proposed data augmentation method in combination with the neural networks model can achieve astonishing performance without any handcrafted features on sentiment analysis or short text classification. It was validated on a Chinese on-line comment dataset and Chinese news headline corpus, and outperforms many state-of-the-art models. Evidence shows that the proposed data argumentation technology can obtain more accurate distribution representation from data for deep learning, which improves the generalization characteristics of the extracted features. The combination of the data argumentation technology and LSCNN fusion model is well suited to short text sentiment analysis, especially on small scale corpus.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel approach to generate a large scale of supervised data for short text sentiment analysis

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Feb 12, 2018
Citations: 40

Similar Papers

A multi-granularity data augmentation based fusion neural network model for short text sentiment analysis
Xiao Sun ... Changqin Quan
-
Xiao Sun, et. al.Xiao Sun ... Changqin Quan
01 Oct 2017
01 Oct 2017

A dual deep neural network with phrase structure and attention mechanism for sentiment analysis
Dongning Rao ... Rizwan Patan
Neural Computing and Applications | VOL. 33
Dongning Rao, et. al.Dongning Rao ... Rizwan Patan
11 Jan 2021
Neural Computing and Applications | VOL. 33

Self‐supervised short text classification with heterogeneous graph neural networks
Meng Cao ... Baoming Zhang
Expert Systems | VOL. 40
Meng Cao, et. al.Meng Cao ... Baoming Zhang
03 Mar 2023
Expert Systems | VOL. 40

Keyword-Text Graph Representation for Short Text Classification
Piyawat Chuanakrud ... Nont Kanungsukkasem
-
Piyawat Chuanakrud, et. al.Piyawat Chuanakrud ... Nont Kanungsukkasem
14 Oct 2021
14 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel approach to generate a large scale of supervised data for short text sentiment analysis

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications