On point estimation of the abnormality of a Mahalanobis index

Fadlalla G Elfadaly,Paul H Garthwaite,John R Crawford

doi:10.1016/j.csda.2016.01.014

Fadlalla G Elfadaly, Paul H Garthwaite + Show 1 more

Open Access

https://doi.org/10.1016/j.csda.2016.01.014

Copy DOI

Abstract

Mahalanobis distance may be used as a measure of the disparity between an individual’s profile of scores and the average profile of a population of controls. The degree to which the individual’s profile is unusual can then be equated to the proportion of the population who would have a larger Mahalanobis distance than the individual. Several estimators of this proportion are examined. These include plug-in maximum likelihood estimators, medians, the posterior mean from a Bayesian probability matching prior, an estimator derived from a Taylor expansion, and two forms of polynomial approximation, one based on Bernstein polynomial and one on a quadrature method. Simulations show that some estimators, including the commonly-used plug-in maximum likelihood estimators, can have substantial bias for small or moderate sample sizes. The polynomial approximations yield estimators that have low bias, with the quadrature method marginally to be preferred over Bernstein polynomials. However, the polynomial estimators sometimes yield infeasible estimates that are outside the 0–1 range. While none of the estimators are perfectly unbiased, the median estimators match their definition; in simulations their estimates of the proportion have a median error close to zero. The standard median estimator can give unrealistically small estimates (including 0) and an adjustment is proposed that ensures estimates are always credible. This latter estimator has much to recommend it when unbiasedness is not of paramount importance, while the quadrature method is recommended when bias is the dominant issue.

Highlights

The Mahalanobis distance is frequently used in multivariate analysis as a statistical measure of distance between a vector of scores for a single case and the mean vector of the underlying population or a sample of data. It was developed by Mahalanobis (1936) as a distance measure that incorporates the correlation between different scores
We propose some alternative estimators of P and compare them in terms of their bias and root mean square error in the simulation study
The third estimator in this group is a Bayesian estimator; it is based on the idea of probability matching priors and is denoted by PBY. We propose another two new estimators of P based on the mean of the non-centrality parameter of a non-central F distribution; these are denoted by PM and PR

Summary

Introduction

The Mahalanobis distance is frequently used in multivariate analysis as a statistical measure of distance between a vector of scores for a single case and the mean vector of the underlying population or a sample of data. The commonly used estimates of P are the p-value computed from the chi-square distribution of the sample Mahalanobis index, or the p-value from the central F distribution associated with Hotelling’s T 2 test. In remote sensing image analysis, Foody (2006) was interested in measuring the closeness of an image pixel to a single class centroid He used the Mahalanobis distance and converted the calculated Mahalanobis distance, of a particular image pixel from a specified class centroid, to its associated p-value from the chi-square distribution.

Two plug-in maximum likelihood estimators of P

Classical estimator of the median

Modified estimator of the median

Bayesian probability matching

Estimators based on the mean of λ

An estimator based on a Taylor expansion

Estimators based on polynomial approximations

Bernstein polynomials approximation

Quadrature polynomial approximation

Simulation results

Ranges of estimates

Performances as measured by absolute error

Findings

Concluding comments

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Statistics & Data Analysis	Publication Date: Jan 29, 2016
Citations: 31	License type: cc-by

R Discovery Prime

R Discovery Prime

On point estimation of the abnormality of a Mahalanobis index

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Statistics & Data Analysis

Lead the way for us

Similar Papers

Numerical Simulation by the Quadrature and Cubature Methods
Faruk Civan
-
Faruk CivanFaruk Civan
10 Oct 1994
10 Oct 1994

A Semi-Bayesian Method for Shewhart Individual Control Charts
M B Thijs Vermaat ... Ronald J M M Does
Quality Technology & Quantitative Management | VOL. 3
M B Thijs Vermaat, et. al.M B Thijs Vermaat ... Ronald J M M Does
01 Jan 2006
Quality Technology & Quantitative Management | VOL. 3

Adaptive smoothing in associated kernel discrete functions estimation using Bayesian approach
N Zougab ... C C Kokonendji
Journal of Statistical Computation and Simulation | VOL. 83
N Zougab, et. al.N Zougab ... C C Kokonendji
01 Dec 2013
Journal of Statistical Computation and Simulation | VOL. 83

Bootstrap Guided Information Criterion for Reliability Analysis Using Small Sample Size Information
Eshan Amalnerkar ... Woochul Lim
-
Eshan Amalnerkar, et. al.Eshan Amalnerkar ... Woochul Lim
06 Dec 2017
06 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On point estimation of the abnormality of a Mahalanobis index

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Statistics &amp; Data Analysis

More From: Computational Statistics & Data Analysis