Abstract

The unavailability of an annotated dataset for a low-resource Ewe language makes it difficult to develop an automated system to appropriately evaluate public opinion on events, news, policies, and regulations. In this study, we collected and preprocessed a low-resourced document-level Ewe sentiment dataset based on social media comments. We used three (3) features learned by word embeddings (Global vectors, word-to-vector, and FastText) rather than hand-crafted features. We further proposed a novel method termed MC2D-CNN+BiLSTM-Attn to detect the exact sentiment feature from the Ewe dataset. Extensive experiments indicate that the proposed method efficiently classifies various sentiments and is superior to benchmark deep learning methods. Results show that in detecting the precise sentiments from raw Ewe textual context, the BiLSTM incorporating Glove outperforms word2vec and FastText embedding with an accuracy of 0.727. Furthermore, Attn+BiLSTM and Multi-channel CNN methods incorporating the word2vec embedding layer perform better than Glove and FastText embedding with an accuracy of 0.848 and 0.896. In contrast, our proposed method with the same word2vec embedding recorded 0.949.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call