How to Classify, Detect, and Manage Univariate and Multivariate Outliers, With Emphasis on Pre-Registration

Christophe Leys,Marie Delacre,Daniël Lakens,Youri L Mora,Christophe Ley

doi:10.5334/irsp.289

Christophe Leys, Marie Delacre + Show 3 more

Open Access

https://doi.org/10.5334/irsp.289

Copy DOI

Abstract

Researchers often lack knowledge about how to deal with outliers when analyzing their data. Even more frequently, researchers do not pre-specify how they plan to manage outliers. In this paper we aim to improve research practices by outlining what you need to know about outliers. We start by providing a functional definition of outliers. We then lay down an appropriate nomenclature/classification of outliers. This nomenclature is used to understand what kinds of outliers can be encountered and serves as a guideline to make appropriate decisions regarding the conservation, deletion, or recoding of outliers. These decisions might impact the validity of statistical inferences as well as the reproducibility of our experiments. To be able to make informed decisions about outliers you first need proper detection tools. We remind readers why the most common outlier detection methods are problematic and recommend the use of the median absolute deviation to detect univariate outliers, and of the Mahalanobis-MCD distance to detect multivariate outliers. An R package was created that can be used to easily perform these detection tests. Finally, we promote the use of pre-registration to avoid flexibility in data analysis when handling outliers.

Highlights

In other words: (1) we suggest collecting enough data so that removing outliers is possible without compromising the statistical power; (2) if outliers are believed to be random, it is acceptable to leave them as they are; (3) if, for pragmatic reasons, researchers are forced to keep outliers that they detected as outliers influenced by moderators, the Winsorization or other transformations are acceptable in order to avoid the loss of power
To face situations not envisaged in the pre-registration or to deal with instances where sticking to pre-registration seems erroneous, we propose three other options: 1) Asking judges blind to the research hypotheses to make a decision on whether outliers that do not correspond to the a priori decision criteria should be included
In this paper, we stressed the importance of outliers in several ways: to detect error outliers; to gain theoretical insights by identifying new moderators that can cause outlying values; and to improve the robustness of the statistical analyses

Summary

RESEARCH ARTICLE

How to Classify, Detect, and Manage Univariate and Multivariate Outliers, With Emphasis on Pre-Registration. The first is attractive for its simplicity: ‘Data values that are unusually large or small compared to the other values of the same construct’ (Aguinis et al 2013: 275, Table 1) This definition only applies to single constructs; researchers should consider multivariate outliers (i.e., outliers because of a surprising pattern across several variables). In a previous paper, Leys et al (2018) highlight a situation where outliers can be considered as heuristic tools, allowing researchers to gain insights regarding the processes under examination (see McGuire, 1997): ‘Consider a person who would exhibit a very high level of in-group identification but a very low level of prejudice towards a specific outgroup This would count as an outlier under the theory that group identification leads to prejudice towards relevant out-groups. The slope of the regression line can be computed as follows:

Yi Xi

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Review of Social Psychology	Publication Date: Apr 30, 2019
Citations: 165	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

How to Classify, Detect, and Manage Univariate and Multivariate Outliers, With Emphasis on Pre-Registration

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Review of Social Psychology

Lead the way for us

Similar Papers

A computer science odyssey.
Brooks Hanson ... Robert Coontz
Science (New York, N.Y.) | VOL. 293
Brooks Hanson, et. al.Brooks Hanson ... Robert Coontz
14 Sep 2001
Science (New York, N.Y.) | VOL. 293

Application of multivariate outlier detection to fluid velocity measurements
John Griffin ... Louis N Cattafesta
Experiments in Fluids | VOL. 49
John Griffin, et. al.John Griffin ... Louis N Cattafesta
14 Apr 2010
Experiments in Fluids | VOL. 49

The Redescending M estimator For detection and deletion of Outliers in Regression analysis
Stella Anekwe ... Sidney Onyeagu
Pakistan Journal of Statistics and Operation Research | VOL. -
Stella Anekwe, et. al.Stella Anekwe ... Sidney Onyeagu
02 Dec 2021
Pakistan Journal of Statistics and Operation Research | VOL. -

Multidimensional Signals and Analytic Flexibility: Estimating Degrees of Freedom in Human-Speech Analyses
...
Advances in Methods and Practices in Psychological Science | VOL. 6
, et. al. ...
01 Jul 2023
Advances in Methods and Practices in Psychological Science | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How to Classify, Detect, and Manage Univariate and Multivariate Outliers, With Emphasis on Pre-Registration

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Review of Social Psychology