WITHDRAWN: SMO-RF:A machine learning approach by random forest for predicting class imbalancing followed by SMOTE

Ankur Goyal,Likhita Rathore,Avinash Sharma

doi:10.1016/j.matpr.2020.12.891

Abstract

In different fields, such as machine learning and data mining, class imbalances have been one of the most complex issues for the past few decades. The unique condition of an imbalanced dataset that distributes each class of a particular dataset unevenly. The positive class is slightly smaller than the negative one. Many standard classification algorithms in this case do not classify instances related to the positive class. Typically the main goal of the classification task is a positive class. To deal with this problem, several approaches were proposed, for example sampling dependent over-sampling, undersampling, classification level enhancements, or the combination of two or more classifiers. The major problem however is that most solutions have a negative class, a computational cost, a storage problem, or a long training period. Data upsampling or downsampling may resolve a possible solution to the issue of skewness of data. In this paper, a hybrid technique is presented, followed by a random forest algorithm (SMO-RF), to categorized binary imbalanced data using the Technique of Synthetic Minority Oversampling. We have tested our model with four standard imbalanced datasets and obtained a higher F-measure, G-mean as well as ROC values for all data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

WITHDRAWN: SMO-RF:A machine learning approach by random forest for predicting class imbalancing followed by SMOTE

Abstract

Talk to us

Similar Papers

More From: Materials Today: Proceedings

Lead the way for us

Journal: Materials Today: Proceedings	Publication Date: Feb 1, 2021
Citations: 4

Similar Papers

A Classification Model For Class Imbalance Dataset Using Genetic Programming
Mirza Amaad Ul Haq Tahir ... Awais Manzoor
IEEE Access | VOL. 7
Mirza Amaad Ul Haq Tahir, et. al.Mirza Amaad Ul Haq Tahir ... Awais Manzoor
01 Jan 2019
IEEE Access | VOL. 7

Comparing SVM ensembles for imbalanced datasets
Vasudha Bhatnagar ... Manju Bhardwaj
-
Vasudha Bhatnagar, et. al.Vasudha Bhatnagar ... Manju Bhardwaj
01 Nov 2010
01 Nov 2010

ForesTexter: An efficient random forest algorithm for imbalanced text categorization
Qingyao Wu ... Shen-Shyang Ho
Knowledge-Based Systems | VOL. 67
Qingyao Wu, et. al.Qingyao Wu ... Shen-Shyang Ho
19 Jun 2014
Knowledge-Based Systems | VOL. 67

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

WITHDRAWN: SMO-RF:A machine learning approach by random forest for predicting class imbalancing followed by SMOTE

Abstract

Talk to us

Similar Papers

More From: Materials Today: Proceedings