A Sentiment Classification in Bengali and Machine Translated English Corpus

Salim Sazzed,Sampath Jayarathna

doi:10.1109/iri.2019.00029

Abstract

The resource constraints in many languages have made the multi-lingual sentiment analysis approach a viable alternative for sentiment classification. Although a good amount of research has been conducted using a multi-lingual approach in languages like Chinese, Italian, Romanian, etc. very limited research has been done in Bengali. This paper presents a bilingual approach to sentiment analysis by comparing machine translated Bengali corpus to its original form. We apply multiple machine learning algorithms: Logistic Regression (LR), Ridge Regression (RR), Support Vector Machine (SVM), Random Forest (RF), Extra Randomized Trees (ET) and Long Short-Term Memory (LSTM) to a collection of Bengali corpus and corresponding machine translated English version. The results suggest that using machine translation improves classifiers performance in both datasets. Moreover, the results show that the unigram model performs better than higher-order n-gram model in both datasets due to linguistic variations and presence of misspelled words results from complex typing system of Bengali language; sparseness and noise in the machine translated data, and because of small datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Sentiment Classification in Bengali and Machine Translated English Corpus

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

URL-Based Sentiment Analysis of Product Reviews Using LSTM and GRU
Aakash ... Amandeep Noliya
Procedia Computer Science | VOL. 235
Aakash, et. al. Aakash ... Amandeep Noliya
01 Jan 2024
Procedia Computer Science | VOL. 235

Sentiment Analysis of Self Driving Car Dataset: A comparative study of Deep Learning approaches
Devshri Pandya ... Ankit Thakkar
Procedia Computer Science | VOL. 235
Devshri Pandya, et. al.Devshri Pandya ... Ankit Thakkar
01 Jan 2024
Procedia Computer Science | VOL. 235

Classification of Book Review Sentiment in Bangla Language Using NLP, Machine Learning and LSTM
Md Hamidur Rahman ... Md Mehedi Hasan
-
Md Hamidur Rahman, et. al.Md Hamidur Rahman ... Md Mehedi Hasan
06 Jul 2021
06 Jul 2021

Real-Time Twitter Spam Detection and Sentiment Analysis using Machine Learning and Deep Learning Techniques.
Anisha P Rodrigues ... Roshan Fernandes
Computational Intelligence and Neuroscience | VOL. 2022
Anisha P Rodrigues, et. al.Anisha P Rodrigues ... Roshan Fernandes
15 Apr 2022
Computational Intelligence and Neuroscience | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Sentiment Classification in Bengali and Machine Translated English Corpus

Abstract

Talk to us

Similar Papers