Abstract

The use of big data in companies is currently used in file processing. With large capacity files, it can affect the performance in terms of time in the company, so to overcome the problem of high-dimensional data, feature selection is used in selecting the number of features. On the WDC dataset with 30 features and 569 data points, feature selection is performed using the Recusive Feature Elimination (RFE) and Genetic Algorithm (GA) models. Then a comparison of evaluation values is made to determine which feature selection is best for solving the problem. From the 14 tables of evaluation results and discussion in tables 1 to 14, it is found that in the evaluation of accuracy and the use of weighted macros on precision, recall, and f1 score, using GA selection features has slightly higher results than RFE, so it is concluded that GA selection features are better at solving problems in high-dimensional data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call