Nonlinear Random Forest Classification, a Copula-Based Approach

Radko Mesiar,Ayyub Sheikhi

doi:10.3390/app11157140

Abstract

In this work, we use a copula-based approach to select the most important features for a random forest classification. Based on associated copulas between these features, we carry out this feature selection. We then embed the selected features to a random forest algorithm to classify a label-valued outcome. Our algorithm enables us to select the most relevant features when the features are not necessarily connected by a linear function; also, we can stop the classification when we reach the desired level of accuracy. We apply this method on a simulation study as well as a real dataset of COVID-19 and for a diabetes dataset.

Highlights

Dimension reduction is a major area of interest within the field of data mining and knowledge discovery, especially in high-dimensional analysis
A copula-based algorithm has been employed in a random forest classification
The idea of this paper may be extended in some manners. One may use this idea in a multi-class random forest classification

Summary

Introduction

Dimension reduction is a major area of interest within the field of data mining and knowledge discovery, especially in high-dimensional analysis. The issue of machine learning has received considerable attention; a number of researchers have sought to perform more accurate dimension reductions in this issue [1,2]. There are many areas of statistics and machine learning that benefit from feature selection techniques. From the statistics point of view, Han and Liu et al (2013) [3] and Basabi (2008) [4] have applied feature selection for multivariate time series. Debashis et al (2008) [5] have investigated feature selection and regression in high-dimensional problems

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Aug 2, 2021
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Nonlinear Random Forest Classification, a Copula-Based Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Random forest classification of Callicarpa nudiflora from WorldView-3 imagery based on optimized feature space
Ting-Ting Shi ... Lu-Qi Huang
Zhongguo Zhong yao za zhi = Zhongguo zhongyao zazhi = China journal of Chinese materia medica | VOL. 44
Ting-Ting Shi, et. al.Ting-Ting Shi ... Lu-Qi Huang
01 Oct 2019
Zhongguo Zhong yao za zhi = Zhongguo zhongyao zazhi = China journal of Chinese materia medica | VOL. 44

A study on the classification of vegetation point cloud based on random forest in the straw checkerboard barriers area
Tiebo Sun ... Tingting Sui
Journal of Intelligent & Fuzzy Systems | VOL. 41
Tiebo Sun, et. al.Tiebo Sun ... Tingting Sui
01 Jan 2020
Journal of Intelligent & Fuzzy Systems | VOL. 41

An improved binary manta ray foraging optimization algorithm based feature selection and random forest classifier for network intrusion detection
Ibrahim Hayatu Hassan ... Sahabi Ali Yusuf
Intelligent Systems with Applications | VOL. 16
Ibrahim Hayatu Hassan, et. al.Ibrahim Hayatu Hassan ... Sahabi Ali Yusuf
01 Nov 2022
Intelligent Systems with Applications | VOL. 16

Employing cluster-based class decomposition approach to detect phishing websites using machine learning classifiers
Yousif Al-Tamimi ... Mohammad Shkoukani
International Journal of Data and Network Science | VOL. 7
Yousif Al-Tamimi, et. al.Yousif Al-Tamimi ... Mohammad Shkoukani
01 Jan 2023
International Journal of Data and Network Science | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nonlinear Random Forest Classification, a Copula-Based Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences