Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data

Aiedh Mrisi Alharthi,Zakariya Yahya Algamal,Muhammad Hisyam Lee

doi:10.3991/ijoe.v18i02.25047

Abstract

Analysis without adequate handling of missing values may lead to inconsistent and biased estimates. Despite multiple imputations becoming a widely used approach in handling missing data, manuscript researchers generally encounter missing data in their respective studies. In high-dimensional data, penalized regression is a popular technique for performing feature selection and coefficient estimation simultaneously. However, one of the most vital issues with high-dimensional data is that it often contains large quantities of missing data that common multiple imputation approaches may not work correctly. Therefore, this study uses imputations penalized regression models as an extension of the penalized methods to improve the performance and impute missing values in high-dimensional data. The method was applied to real-life high-dimensional datasets for the different number of features, sample sizes, and missing dataset rates to evaluate its eﬃciency. The method was also compared with other existing imputation penalized methods for high-dimensional data. The comparative experimental results indicate that the proposed method outperforms its competitors by achieving higher sensitivity, specificity, and classification accuracy values.

Highlights

Missing data exist in almost all areas of biomedical, epidemiological, and social research
There has been significant progress in the methods and tools for variable selection, missing data often occurs in extensive, complicated research and which can make data analysis challenging
It is mainly focused on improving the performance of penalized logistic regression models and handling missing values in high-dimensional data through the imputations adaptive penalized logistic regression (IAPLR) method

Summary

Introduction

Missing data exist in almost all areas of biomedical, epidemiological, and social research. Many statistical techniques often require complete cases without any missing data. This as inaccurate estimates and conclusions may result from an analysis that does not properly handle missing values [2]. Delete a high number of observations with missing values, on the other hand, results in a considerable loss of data [3], [4]. It has a negative impact on the data's statistical power and efficiency [5]. To overcome the missing values in high-dimensional data, reliable imputation approaches are required

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Online and Biomedical Engineering (iJOE)

Lead the way for us

Journal: International Journal of Online and Biomedical Engineering (iJOE)	Publication Date: Feb 16, 2022
License type: CC BY 4.0

Similar Papers

What is missing from my missing data plan?
Sharon D Yeatts ... Renée H Martin
Stroke | VOL. 46
Sharon D Yeatts, et. al.Sharon D Yeatts ... Renée H Martin
07 May 2015
Stroke | VOL. 46

Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study
Andrea Marshall ... Patrick Royston
BMC Medical Research Methodology | VOL. 10
Andrea Marshall, et. al.Andrea Marshall ... Patrick Royston
19 Jan 2010
BMC Medical Research Methodology | VOL. 10

An ensemble learning method for variable selection: application to high-dimensional data and missing values
Avner Bar-Hen ... Vincent Audigier
Journal of Statistical Computation and Simulation | VOL. ahead-of-print
Avner Bar-Hen, et. al.Avner Bar-Hen ... Vincent Audigier
07 May 2022
Journal of Statistical Computation and Simulation | VOL. ahead-of-print

Comparison of Outcomes for Children With Cervical Spine Injury Based on Destination Hospital From Scene of Injury
Jennifer F Anders ... Kathleen Adelgais
Academic Emergency Medicine | VOL. 21
Jennifer F Anders, et. al.Jennifer F Anders ... Kathleen Adelgais
01 Jan 2014
Academic Emergency Medicine | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Online and Biomedical Engineering (iJOE)