Abstract

Abstract Outlier detection has become increasingly important for high dimensional data (HDD) before data analysis. Although the multivariate outlier detection methods (ODMs) have been extensively studied in the research literature, there are few ODMs for ODMs when a correlation exists between data’s components. In this paper, we introduce a partition-based estimator of the minimum covariance determinant for HDD, along with a rapid algorithm designed to identify the clean subset. Additionally, an OD threshold is employed by examining the asymptotic distribution of the outlying measure distance to pinpoint outliers. Moreover, a one-step reweighting scheme is introduced to enhance the detection efficiency of the process. At last, simulation results demonstrate that the proposed RPMCD methods exhibit superior finite sample performance compared to other existing techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.