Abstract

Linear Discriminant Analysis (LDA) is the most commonly employed method for classification. This method which creates a linear discriminant function yields optimal classification rule between two or more groups under the assumptions of normality and homoscedasticity (equal covariance matrices). However, the calculation of parametric LDA highly relies on the sample mean vectors and pooled sample covariance matrix which are sensitive to non-normality. To overcome the sensitivity of this method towards non-normality as well as homoscedasticity, this study proposes two new robust LDA models. In these models, an automatic trimmed mean and its corresponding winsorized mean are employed to replace the mean vector in the parametric LDA. Meanwhile, for the covariance matrix, this study introduces two robust approaches namely the winsorization and the multiplication of Spearman's rho with the corresponding robust scale estimator used in the trimming process. Simulated and real financial data are used to test the performance of the proposed methods in terms of misclassification rate. The numerical result shows that the new method performs better if compared to the parametric LDA and the robust LDA with S-estimator. Thus, these new models can be recommended as alternatives to the parametric LDA when non-normality and heteroscedasticity (unequal covariance matrices) exist.

Highlights

  • Linear Discriminant Analysis (LDA) is one of the most widely used statistical approaches for analyzing attribute variables in supervised classification (Elizabeth and Andres, 2012)

  • We introduced two robust estimators namely the automatic trimmed mean, which is known as modified one-step M-estimator (MOM) and its winsorized version, referred to as winsorized modified one-step M-estimator (WMOM) to construct Robust Linear Discriminant Analysis (RLDA) model

  • The covariance matrix will be estimated using two approaches; the winsorized covariance and the product of spearman correlation coefficient and rescaled Median Absolute Deviation (MADn). These covariance matrices will be paired with the corresponding WMOM and MOM location estimates to form robust discriminant rule denoted as RLDAWM and RLDAM, respectively

Read more

Summary

Introduction

Linear Discriminant Analysis (LDA) is one of the most widely used statistical approaches for analyzing attribute variables in supervised classification (Elizabeth and Andres, 2012). The purpose of LDA is to determine which variable discriminates between two or more classes and to construct a classification model for predicting the group membership of new observations. Two approaches, namely trimming and winsorizing are proposed in the construction of new RLDA models to create discriminant rule that are robust to deviation.

Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.