Abstract

Previous research on sentiment classification by machine learning algorithms has shown that they usually work well with large-scale dataset. However, most open datasets for Chinese sentiment classification are quite small. In this paper we build a large-scale annotated Chinese sentiment dataset by filtering a vast amount of human-computer conversations. We conduct thorough experiments by using Convolutional Neural Networks (CNNs) and other classical machine learning methods. The experimental evaluation is made on a human-annotated dataset and COAE2014 task. The extensive experiments demonstrate that Chinese sentiment classification task can benefit from our dataset and CNNs model can achieve better performance than classical machine learning approaches.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call