Cost-Sensitive Feature Selection for Class Imbalance Problem

Małgorzata Bach,Aleksandra Werner

doi:10.1007/978-3-319-67220-5_17

Abstract

The class imbalance problem is encountered in real-world applications of machine learning and results in suboptimal performance during data classification. This is especially true when data is not only imbalanced but also high dimensional. The class imbalance is very often accompanied by a high dimensionality of datasets and in such a case these problems should be considered together. Traditional feature selection methods usually assign the same weighting to samples from different classes when the samples are used to evaluate each feature. Therefore, they do not work good enough with imbalanced data. In situation when the costs of misclassification of different classes are diverse, cost-sensitive learning methods are often applied. These methods are usually used in the classification phase, but we propose to take the cost factors into consideration during the feature selection. In this study we analyse whether the use of cost-sensitive feature selection followed by resampling can give good results for mentioned problems. To evaluate tested methods three imbalanced and multidimensional datasets are considered and the performance of chosen feature selection methods and classifiers are analysed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cost-Sensitive Feature Selection for Class Imbalance Problem

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Cost-Sensitive Feature Selection via F-Measure Optimization Reduction
Meng Liu ... Yong Luo
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31
Meng Liu, et. al.Meng Liu ... Yong Luo
13 Feb 2017
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31

Feature selection for high dimensional imbalanced class data based on F-measure optimization
Chunkai Zhang ... Ying Zhou
-
Chunkai Zhang, et. al.Chunkai Zhang ... Ying Zhou
01 Dec 2017
01 Dec 2017

Two-Stage Cost-Sensitive Learning for Software Defect Prediction
Mingxia Liu ... Linsong Miao
IEEE Transactions on Reliability | VOL. 63
Mingxia Liu, et. al.Mingxia Liu ... Linsong Miao
01 Jun 2014
IEEE Transactions on Reliability | VOL. 63

Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection
Lijue Liu ... Shiyang Tan
BMC Medical Informatics and Decision Making | VOL. 22
Lijue Liu, et. al.Lijue Liu ... Shiyang Tan
28 Mar 2022
BMC Medical Informatics and Decision Making | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cost-Sensitive Feature Selection for Class Imbalance Problem

Abstract

Talk to us

Similar Papers