A novel feature selection framework for incomplete data

Cong Guo,Wei Yang,Zheng Li,Chun Liu

doi:10.1016/j.chemolab.2024.105193

Abstract

Feature selection on incomplete datasets is a challenging task. To address this challenge, existing methods first employ imputation methods to complete the dataset and then perform feature selection based on the imputed dataset. Since missing value imputation and feature selection are entirely independent, the importance of features cannot be considered during imputation. However, in real-world scenarios or datasets, different features have varying degrees of importance. To this end, we proposed a novel incomplete data feature selection framework that considers feature importance. The framework mainly consists of two alternating iterative stages: M-stage and W-stage. In the M-stage, missing values are imputed based on a given feature importance vector and multiple initial imputation results. In the W-stage, an improved reliefF algorithm is employed to learn the feature importance vector based on the imputed data. In particular, the feature importance output by the W-stage in the current iteration will be used as the input of the M-stage in the next iteration. Experimental results on artificial and real missing datasets demonstrate that the proposed method outperforms other approaches significantly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel feature selection framework for incomplete data

Abstract

Talk to us

Similar Papers

More From: Chemometrics and Intelligent Laboratory Systems

Lead the way for us

Similar Papers

Temperature correction of near-infrared spectra of raw milk
Jose A Diaz-Olivares ... Ben Aernouts
Chemometrics and Intelligent Laboratory Systems | VOL. 255
Jose A Diaz-Olivares, et. al.Jose A Diaz-Olivares ... Ben Aernouts
18 Oct 2024
Chemometrics and Intelligent Laboratory Systems | VOL. 255

LTFM: Long-tail few-shot module with loose coupling strategy for mineral spectral identification
Youpeng Fan ... Yongchun Fang
Chemometrics and Intelligent Laboratory Systems | VOL. 254
Youpeng Fan, et. al.Youpeng Fan ... Yongchun Fang
15 Oct 2024
Chemometrics and Intelligent Laboratory Systems | VOL. 254

Recent applications of analytical quality-by-design methodology for chromatographic analysis: A review
Doan Thanh Xuan ... Vu Dang Hoang
Chemometrics and Intelligent Laboratory Systems | VOL. 254
Doan Thanh Xuan, et. al.Doan Thanh Xuan ... Vu Dang Hoang
10 Oct 2024
Chemometrics and Intelligent Laboratory Systems | VOL. 254

Layer-wise-residual-driven approach for soft sensing in composite dynamic system based on slow and fast time-varying latent variables
Zhengxuan Zhang ... Yuri A.W Shardt
Chemometrics and Intelligent Laboratory Systems | VOL. 254
Zhengxuan Zhang, et. al.Zhengxuan Zhang ... Yuri A.W Shardt
09 Oct 2024
Chemometrics and Intelligent Laboratory Systems | VOL. 254

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel feature selection framework for incomplete data

Abstract

Talk to us

Similar Papers

More From: Chemometrics and Intelligent Laboratory Systems