H‐type indices with applications in chemometrics II: h‐outlyingness index

Qin Yang,Guo‐Li Tian,Lu Xu,Ben‐Qing Wu

doi:10.1002/cem.3375

Abstract

AbstractAn outlier is generally considered as a data point that deviates from the “bulk” of all the data points. For outlier diagnosis, two questions could be asked: (1) How far is an object from the bulk? and (2) how many data points do the “bulk” include? To simultaneously deal with the above two questions, the h‐outlyingness index (HOI) is defined as suppose a given data point in a data set of N data points, if at most M% of all the (N − 1) one‐to‐rest distances is no less than M% of all the N(N − 1)/2 pairwise distances, the HOI value for the given data point will be M%. For applications, HOI was used for outlier diagnosis in simulated and real data sets, and the results were compared with those obtained by some robust statistical methods. Compared with the traditional methods, HOI gained similar results. For high‐dimensional data, it was wise to compute HOI based on dimension reduction methods such as principal component analysis (PCA). HOI was demonstrated to be a simple, easy‐to‐compute, robust and effective index for outlier diagnosis. Moreover, HOI is a nonparametric method that has no underlying assumptions on data distribution, which will be useful in chemometrics for multivariate outlier diagnosis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

H‐type indices with applications in chemometrics II: h‐outlyingness index

Abstract

Talk to us

Similar Papers

More From: Journal of Chemometrics

Lead the way for us

Similar Papers

Data Visualization by Pairwise Distortion Minimization
Marc Sobel ... Longin Jan Latecki
Communications in Statistics - Theory and Methods | VOL. 34
Marc Sobel, et. al.Marc Sobel ... Longin Jan Latecki
01 Jun 2005
Communications in Statistics - Theory and Methods | VOL. 34

Feature Dimensionality Reduction with Graph Embedding and Generalized Hamming Distance
Honglei Zhang ... Mancef Gabbouj
-
Honglei Zhang, et. al.Honglei Zhang ... Mancef Gabbouj
01 Oct 2018
01 Oct 2018

Quantifying the relative importance of experimental data points in parameter estimation
Jenny E Jeong ... Peng Qiu
BMC Systems Biology | VOL. 12
Jenny E Jeong, et. al.Jenny E Jeong ... Peng Qiu
01 Nov 2018
BMC Systems Biology | VOL. 12

Accumulated Relative Density Outlier Detection For Large Scale Traffic Data
Sophia W.T.T Liu ... Steven J Simske
Electronic Imaging | VOL. 30
Sophia W.T.T Liu, et. al.Sophia W.T.T Liu ... Steven J Simske
28 Jan 2018
Electronic Imaging | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

H‐type indices with applications in chemometrics II: h‐outlyingness index

Abstract

Talk to us

Similar Papers

More From: Journal of Chemometrics