Iterative Ensemble Learning over High Dimensional Data for Sentiment Analysis

V R N S S V Saileela P Saileela P,N Naga Malleswara Rao

doi:10.12694/scpe.v25i2.2650

Abstract

For sentiment analysis in particular, the problem of processing and analyzing high-dimensional data becomes more prominent in recent past. This is where the IEL-HDDSA model, which aims to increase accuracy and performance in complex, high-dimensional data streams sentiment analysis comes into play. Iterative approach in ensemble learning; a contribution to the field. It integrates preprocessing techniques such as tokenization, stop word removal, lemmatization and the collection of sentiment-related features. Then the training corpus is divided by label, and features with high mutual information are selected. Highly replicated points of data for model training can also be identified at this point. First a Naive Bayes model is trained, then later it's placed in an ensemble as part of bagging. Its major advantage over earlier methods is that IEL-HDDSA can iteratively train on selected subsets of data until the performance in sentiment analysis for high-dimensional objects reaches an optimum level. A 10-fold cross validation method was used to rigorously evaluate the performance of this model, which showed consistently high levels of operation with almost no variation across different measures. IEL-HDDSA's precision ranged from 0.9359 to 0.9492, and its specificity was between 0. Its accuracy differed from 0.93 to around 0.95, and its F1-measure fluctuated between the values of about 0.94 and above; so here too balance was well maintained in a manner that satisfied both precision and recall requirements equally. The false alarming rate fell from 0.056 to 0.1, a fairly low ratio of incorrect positive classifications; Moreover, MCC quantities ranged from 0.8668 to 0. These results testify to the IEL-HDDSA model's stable effectiveness and high reproducibility in sentiment analysis applications, especially for massive data flows.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Iterative Ensemble Learning over High Dimensional Data for Sentiment Analysis

Abstract

Talk to us

Similar Papers

More From: Scalable Computing: Practice and Experience

Lead the way for us

Journal: Scalable Computing: Practice and Experience	Publication Date: Feb 24, 2024
License type: mit

Similar Papers

Big data and sentiment analysis: A comprehensive and systematic literature review
Mahdi Hajiali
Concurrency and Computation: Practice and Experience | VOL. 32
Mahdi HajialiMahdi Hajiali
19 Apr 2020
Concurrency and Computation: Practice and Experience | VOL. 32

Retrieval Information Using Generalized Vector Space Models And Sentiment Analysis Using Naïve Bayes Classifier For Evaluation Of Lecturers By Students
Suprianto ... Mussallimah
-
Suprianto, et. al. Suprianto ... Mussallimah
03 Nov 2020
03 Nov 2020

Online sentiment analysis in marketing research: a review
Meena Rambocas ... Barney G Pacheco
Journal of Research in Interactive Marketing | VOL. 12
Meena Rambocas, et. al.Meena Rambocas ... Barney G Pacheco
31 Jan 2018
Journal of Research in Interactive Marketing | VOL. 12

Predicting the customer’s opinion on amazon products using selective memory architecture-based convolutional neural network
Trupthi Mandhula ... Narsimha Gugulotu
The Journal of Supercomputing | VOL. 76
Trupthi Mandhula, et. al.Trupthi Mandhula ... Narsimha Gugulotu
19 Nov 2019
The Journal of Supercomputing | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Iterative Ensemble Learning over High Dimensional Data for Sentiment Analysis

Abstract

Talk to us

Similar Papers

More From: Scalable Computing: Practice and Experience