Although sentiment analysis with datasets in English has achieved significant progress, there are still relatively few studies on sentiment analysis in the area of Chinesecar review texts. In addition, the existing Chinese text sentiment analysis methods cannot simultaneously extract the contextual features and the local semantic information from the review texts. To address the above issues, a hybrid deep learning model namely BiLSTM-CNN was designed and validated with Chinese car review texts. We trained word vectors with the CBOW model by combining the Chinese Wikipedia corpus with other open-source car-related corpora. Such word vectors include car-specific vocabulary, which improves sentimental classification accuracy. Experimental results show that the performance indices of the deep learning models (CNN, LSTM, BiLSTM) are much better than the KNN, SVM, Naïve Bayes, and RF model, which is attributed to deep learning's powerful feature extraction ability and nonlinear fitting ability. Furthermore, when compared to deep learning models such as CNN, LSTM, BiLSTM, and LSTM-CNN, the suggested hybrid BiLSTM-CNN model outperformed the others. The results may provide references for consumers to buy a car and for car companies to optimize their products.
Read full abstract