AI federated learning based improvised random Forest classifier with error reduction mechanism for skewed data sets

Anjali More,Dipti Rana

doi:10.1108/ijpcc-02-2022-0034

Abstract

PurposeReferred data set produces reliable information about the network flows and common attacks meeting with real-world criteria. Accordingly, this study aims to focus on the use of imbalanced intrusion detection benchmark knowledge discovery in database (KDD) data set. KDD data set is most preferably used by many researchers for experimentation and analysis. The proposed algorithm improvised random forest classification with error tuning factors (IRFCETF) deals with experimentation on KDD data set and evaluates the performance of a complete set of network traffic features through IRFCETF.Design/methodology/approachIn the current era of applications, the attention of researchers is immersed by a diverse number of existing time applications that deals with imbalanced data classification (ImDC). Real-time application areas, artificial intelligence (AI), Industrial Internet of Things (IIoT), etc. are dealing ImDC undergo with diverted classification performance due to skewed data distribution (SkDD). There are numerous application areas that deal with SkDD. Many of the data applications in AI and IIoT face the diverted data classification rate in SkDD. In recent advancements, there is an exponential expansion in the volume of computer network data and related application developments. Intrusion detection is one of the demanding applications of ImDC. The proposed study focusses on imbalanced intrusion benchmark data set, KDD data set and other benchmark data set with the proposed IRFCETF approach. IRFCETF justifies the enriched classification performance on imbalanced data set over the existing approach. The purpose of this work is to review imbalanced data applications in numerous application areas including AI and IIoT and tuning the performance with respect to principal component analysis. This study also focusses on the out-of-bag error performance-tuning factor.FindingsExperimental results on KDD data set shows that proposed algorithm gives enriched performance. For referred intrusion detection data set, IRFCETF classification accuracy is 99.57% and error rate is 0.43%.Research limitations/implicationsThis research work extended for further improvements in classification techniques with multiple correspondence analysis (MCA); hierarchical MCA can be focussed with the use of classification models for wide range of skewed data sets.Practical implicationsThe metrics enhancement is measurable and helpful in dealing with intrusion detection systems–related imbalanced applications in current application domains such as security, AI and IIoT digitization. Analytical results show improvised metrics of the proposed approach than other traditional machine learning algorithms. Thus, error-tuning parameter creates a measurable impact on classification accuracy is justified with the proposed IRFCETF.Social implicationsProposed algorithm is useful in numerous IIoT applications such as health care, machinery automation etc.Originality/valueThis research work addressed classification metric enhancement approach IRFCETF. The proposed method yields a test set categorization for each case with error reduction mechanism.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AI federated learning based improvised random Forest classifier with error reduction mechanism for skewed data sets

Abstract

Talk to us

Similar Papers

More From: International Journal of Pervasive Computing and Communications

Lead the way for us

Similar Papers

A Novel Multi-class Classification Architecture Combining Population-based Sampling and Multi-expert Classifier for Imbalanced Data
Haochen Jiang ... Jun Chen
-
Haochen Jiang, et. al.Haochen Jiang ... Jun Chen
17 Oct 2021
17 Oct 2021

Deep Learning for Imbalanced Multimedia Data Classification
Yilin Yan ... Min Chen
-
Yilin Yan, et. al.Yilin Yan ... Min Chen
01 Dec 2015
01 Dec 2015

Learning from Imbalanced Multi-label Data Sets by Using Ensemble Strategies
Fatemeh Shamsezat ... Mohammad Masoud Javidi
Computer Engineering and Applications Journal | VOL. 4
Fatemeh Shamsezat, et. al.Fatemeh Shamsezat ... Mohammad Masoud Javidi
18 Feb 2015
Computer Engineering and Applications Journal | VOL. 4

Artificial Intelligence and Machine Learning for the Industrial Internet of Things (IIoT)
Fanoon Raheem ... Nihla Iqbal
-
Fanoon Raheem, et. al.Fanoon Raheem ... Nihla Iqbal
17 Feb 2022
17 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AI federated learning based improvised random Forest classifier with error reduction mechanism for skewed data sets

Abstract

Talk to us

Similar Papers

More From: International Journal of Pervasive Computing and Communications