Abstract

It is challenging to detect emotions on user-generated contents (UGC) because it tends to have sparse emotional semantics, multiple emotions in the same text, and a fast update of emotional expression. Word embedding can extract high-level features in words to enrich their semantics. The convolutional neural networks (CNNs) can make model training more efficient to adapt to fast-changing emotional expressions, but its original pooling operation can not simultaneously filter out multiple emotional features that are beneficial to classification results, which is not suitable for UGC. In this paper, we propose a self-attentive convolutional neural networks (SACNNs) trained on top of pre-trained word vectors for emotion detection on UGC, which reserves different kinds of emotion information, avoids the loss of emotional aspects in the pooling process of CNNs structure, and increases the interpretability of the model by visualizing the extraction process of features. Meanwhile, the convergence speed of the model training is accelerated, and the model is updated in real time to detect emotions with rich and novel expressions. The proposed model combines the CNNs and the self-attention mechanism, the self-attention mechanism can select key emotional features after convolution. We evaluate the proposed model with two datasets from NLPCC 2014 and SemEval 2018 Task 1. The experimental results show that our model obtain significant performance than baselines in multi-label classification on UGC. In addition, the experimental results and the rationality of the self-attention mechanism are analyzed in detail, and the influential convolutional filter windows are visualized based on attention weights.

Highlights

  • Social media such as Twitter, Sina Weibo provide an open platform for people to share their ideas or opinions and generates masses of personalized texts

  • We propose the self-attentive convolutional neural networks(SACNNs) trained on top of pre-trained word vectors to classify multiple emotions on user generated contents (UGC)

  • We propose SACNNs to effectively extract multiple emotional semantic features on UGC and accelerate the convergence of the model

Read more

Summary

INTRODUCTION

Social media such as Twitter, Sina Weibo provide an open platform for people to share their ideas or opinions and generates masses of personalized texts. The dictionary-based method estimates the emotions by hand-crafted features, The associate editor coordinating the review of this manuscript and approving it for publication was Kathiravan Srinivasan Such as sentiment lexicons, which produce a high efficiency on experimental results, but the accuracy depends on the input words, and updating the dictionary is a challenge. Vocabulary, and can extract the emotional semantic features from the given text in a self-disciplined way is needed According to these characteristics, the feature-based method is more suitable for UGC. CNNs extracts local semantic features of text through convolutional operations, filters feature by pooling operation, which is flexible and efficient. The architecture of our model has strong scalability. Our method can intuitively visualize the process of extracting key emotional semantics from convolutional filter windows based on attention weights, which increases the interpretability of the model. Our model has strong robustness and generalization ability, and can be applied to UGC of different domains, which outperforms most existing text classification models in accuracy and efficiency

RELATED WORK
SELF-ATTENTIVE LAYER
CONNECTION LAYER
EXPERIMENTS
EVALUATION
Findings
CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.