BALANCING SARCASTIC HINGLISH SHORT TEXT DATA USING AUGMENTATION TECHNIQUES WITH HANDLING SPELLING VARIATIONS

Rajshree Singh Rajshree Singh

doi:10.52783/jes.3602

Abstract

In the real world, there is a significant presence of imbalanced data due to the fact that the classes that make up the datasets are not evenly distributed. Even when using methods that are traditionally used to achieve class balance, such as re-sampling & re-weighting, current deep learning still faces a significant obstacle because of the class imbalance. This study’s major objective is proposing a data augmentation technique to balance the data to improve the sample sizes for the minority classes. Python, a well-known programming language, & multiple methods of machine learning are being employed in the execution of this study. Classification models like Logistic Regression, Naïve Bayes, Support Vector Machine, Decision Tree, Random Forest, Extra Trees Classifier, AdaBoost classifier, Gradient Boost classifier was used to implement this study. Precision, recall, & F-score were used to determine which model would be the most effective. According to the findings of this study's analysis, the Naive Bayes approach, which has a F1-Score of 95.85% & has Wn = 3, Cn = 3, & CWn =3 as its parameters, is the technique that yields the most accurate results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BALANCING SARCASTIC HINGLISH SHORT TEXT DATA USING AUGMENTATION TECHNIQUES WITH HANDLING SPELLING VARIATIONS

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems

Lead the way for us

Journal: Journal of Electrical Systems	Publication Date: May 4, 2024
License type: CC BY-ND 4.0

Similar Papers

Identification of pharming in communication networks using ensemble learning
N A Azeez ... S S Oladele
Nigerian Journal of Technological Development | VOL. 19
N A Azeez, et. al.N A Azeez ... S S Oladele
01 Aug 2022
Nigerian Journal of Technological Development | VOL. 19

Analysis of Birth Data using Ensemble Modeling Techniques
Sohaib Latif ... Mansoor Alghamdi
Applied Artificial Intelligence | VOL. 37
Sohaib Latif, et. al.Sohaib Latif ... Mansoor Alghamdi
28 Feb 2023
Applied Artificial Intelligence | VOL. 37

Robust Decision Support System for Stress Prediction Using Ensemble Techniques
Dr Sohaib Latif
Journal of Innovative Computing and Emerging Technologies | VOL. 4
Dr Sohaib LatifDr Sohaib Latif
15 Oct 2024
Journal of Innovative Computing and Emerging Technologies | VOL. 4

Enhancing Talent Recruitment in Business Intelligence Systems: A Comparative Analysis of Machine Learning Models
Hikmat Al-Quhfa ... Abdussalam Aljbri
Analytics | VOL. 3
Hikmat Al-Quhfa, et. al.Hikmat Al-Quhfa ... Abdussalam Aljbri
15 Jul 2024
Analytics | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BALANCING SARCASTIC HINGLISH SHORT TEXT DATA USING AUGMENTATION TECHNIQUES WITH HANDLING SPELLING VARIATIONS

Abstract

Talk to us

Similar Papers

More From: Journal of Electrical Systems