Exploring Amharic Sentiment Analysis from Social Media Texts: Building Annotation Tools and Classification Models

Seid Muhie Yimam,Hizkiel Mitiku Alemayehu,Chris Biemann,Abinew Ayele

doi:10.18653/v1/2020.coling-main.91

Abstract

This paper presents the study of sentiment analysis for Amharic social media texts. As the number of social media users is ever-increasing, social media platforms would like to understand the latent meaning and sentiments of a text to enhance decision-making procedures. However, low-resource languages such as Amharic have received less attention due to several reasons such as lack of well-annotated datasets, unavailability of computing resources, and fewer or no expert researchers in the area. This research addresses three main research questions. We first explore the suitability of existing tools for the sentiment analysis task. Annotation tools are scarce to support large-scale annotation tasks in Amharic. Also, the existing crowdsourcing platforms do not support Amharic text annotation. Hence, we build a social-network-friendly annotation tool called ‘ASAB’ using the Telegram bot. We collect 9.4k tweets, where each tweet is annotated by three Telegram users. Moreover, we explore the suitability of machine learning approaches for Amharic sentiment analysis. The FLAIR deep learning text classifier, based on network embeddings that are computed from a distributional thesaurus, outperforms other supervised classifiers. We further investigate the challenges in building a sentiment analysis system for Amharic and we found that the widespread usage of sarcasm and figurative speech are the main issues in dealing with the problem. To advance the sentiment analysis research in Amharic and other related low-resource languages, we release the dataset, the annotation tool, source code, and models publicly under a permissive.

Highlights

Sentiment analysis is the task of detecting the orientation of someone’s opinion and analyzing the emotions, feelings, and attitudes of a speaker or a writer in a piece of information concerning a certain situation, object, or event (Pandey and Govilkar, 2015)
K-Nearest Neighbor (KNN): KNN works by determining the nearest neighbors to a given query and use those classes to predict the right class of the query (Cunningham and Delany, 2020)
We have followed the suggestions by De Souza Bermejo et al (2019) to categorize sentiment classes into ‘positive’, ‘negative‘, ‘neutral’, and ‘mixed‘

Summary

Introduction

Sentiment analysis is the task of detecting the orientation of someone’s opinion and analyzing the emotions, feelings, and attitudes of a speaker or a writer in a piece of information concerning a certain situation, object, or event (Pandey and Govilkar, 2015). The most widely adopted approach in sentiment analysis to explore opinions is by employing very large datasets that target products and services, political, economical, social, and cultural feelings (Kauffmann et al, 2019; Caetano et al, 2018; Lennox et al, 2020). The absence of well-annotated corpora and NLP resources like parsers and taggers make Amharic sentiment analysis still challenging (Gezmu et al, 2018; Pandey and Govilkar, 2015)

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Amharic Sentiment Analysis from Social Media Texts: Building Annotation Tools and Classification Models

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 22	License type: cc-by

Similar Papers

Structured sentiment analysis in social media
Abdulqader Mohammed A Almars
-
Abdulqader Mohammed A AlmarsAbdulqader Mohammed A Almars
29 Nov 2019
29 Nov 2019

Sentiment Orientation from Code-mixed Social Media Data

International journal of next-generation computing | VOL. 12

03 Apr 2021
International journal of next-generation computing | VOL. 12

Sentiment analysis of comments in social media
Abdulrahman Alrumaih ... Ruaa Alsabah
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 10
Abdulrahman Alrumaih, et. al.Abdulrahman Alrumaih ... Ruaa Alsabah
01 Dec 2020
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 10

Sentiment Analysis of Noisy Malay Text: State of Art, Challenges and Future Work
Muhammad Fakhrur Razi Abu Bakar ... Liyana Shuib
IEEE Access | VOL. 8
Muhammad Fakhrur Razi Abu Bakar, et. al.Muhammad Fakhrur Razi Abu Bakar ... Liyana Shuib
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Amharic Sentiment Analysis from Social Media Texts: Building Annotation Tools and Classification Models

Abstract

Highlights

Summary

Talk to us

Similar Papers