Improving covariance-regularized discriminant analysis for EHR-based predictive analytics of diseases

Sijia Yang,Jiang Bian,Zeyi Sun,Haoyi Xiong,Kaibo Xu,Licheng Wang

doi:10.1007/s10489-020-01810-4

Abstract

Linear Discriminant Analysis (LDA) is a well-known technique for feature extraction and dimension reduction. The performance of classical LDA however, significantly degrades on the High Dimension Low Sample Size (HDLSS) data for the ill-posed inverse problem. Existing approaches for HDLSS data classification typically assume the data in question are with Gaussian distribution and deal the HDLSS classification problem with regularization. However, these assumptions are too strict to hold in many emerging real-life applications, such as enabling personalized predictive analysis using Electronic Health Records (EHRs) data collected from an extremely limited number of patients who have been diagnosed with or without the target disease for prediction. In this paper, we revised the problem of predictive analysis of disease using personal EHR data and LDA classifier. To fill the gap, in this paper, we first studied an analytical model that understands the accuracy of LDA for classifying data with arbitrary distribution. The model gives a theoretical upper bound of LDA error rate that is controlled by two factors: (1) the statistical convergence rate of (inverse) covariance matrix estimators and (2) the divergence of the training/testing datasets to fitted distributions. To this end, we could lower the error rate by balancing the two factors for better classification performance. Hereby, we further proposed a novel LDA classifier De-Sparse that leverages De-sparsified Graphical Lasso to improve the estimation of LDA, which outperforms state-of-the-art LDA approaches developed for HDLSS data. Such advances and effectiveness are further demonstrated by both theoretical analysis and extensive experiments on EHR datasets https://www.overleaf.com/project/5d2728c718f6ff3b2bcf5991 .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving covariance-regularized discriminant analysis for EHR-based predictive analytics of diseases

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: Aug 13, 2020
Citations: 6

Similar Papers

DBSDA : Lowering the Bound of Misclassification Rate for Sparse Linear Discriminant Analysis via Model Debiasing.
Haoyi Xiong ... Jiang Bian
IEEE transactions on neural networks and learning systems | VOL. 30
Haoyi Xiong, et. al.Haoyi Xiong ... Jiang Bian
24 Jul 2018
IEEE transactions on neural networks and learning systems | VOL. 30

Daehr
Haoyi Xiong ... Jinghe Zhang
ACM Transactions on Intelligent Systems and Technology | VOL. 8
Haoyi Xiong, et. al.Haoyi Xiong ... Jinghe Zhang
08 Feb 2017
ACM Transactions on Intelligent Systems and Technology | VOL. 8

Evaluation of Electronic Health Record and Long-Term Care Pharmacy Data for Tracking and Reporting Antibiotic Use in the United States
Matthew Hudson ... Nancy Chi
Antimicrobial Stewardship & Healthcare Epidemiology | VOL. 1
Matthew Hudson, et. al.Matthew Hudson ... Nancy Chi
01 Jul 2021
Antimicrobial Stewardship & Healthcare Epidemiology | VOL. 1

High dimensional low sample size activity recognition using geometric classifiers
Muhammad Shahzad Cheema ... Christian Bauckhage
Digital Signal Processing | VOL. 42
Muhammad Shahzad Cheema, et. al.Muhammad Shahzad Cheema ... Christian Bauckhage
22 Apr 2015
Digital Signal Processing | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving covariance-regularized discriminant analysis for EHR-based predictive analytics of diseases

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence