Why checking model assumptions using null hypothesis significance tests does not suffice: A plea for plausibility

Jesper Tijmstra

doi:10.3758/s13423-018-1447-4

Abstract

This article explores whether the null hypothesis significance testing (NHST) framework provides a sufficient basis for the evaluation of statistical model assumptions. It is argued that while NHST-based tests can provide some degree of confirmation for the model assumption that is evaluated—formulated as the null hypothesis—these tests do not inform us of the degree of support that the data provide for the null hypothesis and to what extent the null hypothesis should be considered to be plausible after having taken the data into account. Addressing the prior plausibility of the model assumption is unavoidable if the goal is to determine how plausible it is that the model assumption holds. Without assessing the prior plausibility of the model assumptions, it remains fully uncertain whether the model of interest gives an adequate description of the data and thus whether it can be considered valid for the application at hand. Although addressing the prior plausibility is difficult, ignoring the prior plausibility is not an option if we want to claim that the inferences of our statistical model can be relied upon.

Highlights

One of the core objectives of the social sciences is to critically evaluate its theories on the basis of empirical observations
The basis of the null hypothesis significance testing (NHST) framework goes back to the statistical paradigm founded by Fisher in the 1930s (Fisher, 1930, 1955, 1956, 1960; Hacking, 1976; Gigerenzer, 1993), as well as the statistical paradigm founded by Neyman and Pearson in that same period (Neyman, 1937, 1957; Pearson, 1955; Neyman & Pearson, 1967; Hacking, 1976; Gigerenzer, 1993)
Starting from the 1950s, elements from both approaches were incorporated in the hybrid NHST framework as it exists today in the social and behavioral sciences (Gigerenzer & Murray, 1987; Gigerenzer, Swijtink, Porter, Daston, Beatty, & Kruger, 1989; Gigerenzer, 1993; Lehmann, 2006): While this framework proposes to evaluate a null hypothesis in contrast to an alternative hypothesis—in line with Neyman and Pearson—the focus lies on attempting to reject the null hypothesis, in line with Fisher’s methodology

Summary

Introduction

One of the core objectives of the social sciences is to critically evaluate its theories on the basis of empirical observations. 1993, 2004), with alternatives such as using confidence intervals (Cumming & Finch, 2005; Fidler & Loftus, 2009; Cumming, 2014) or Bayesian approaches (Wagenmakers, 2007) being proposed as more informative and more appropriate tools for statistical inference These criticisms have been unanimous in their rejection of the use of (solely) NHST-based methods for evaluating substantive hypotheses and have had an important impact on both recommended and actual statistical practices, the uncritical and inappropriate use of NHST still appears to be quite common in practice. Standard methods for assessing whether statistical model assumptions hold often rely on NHST as well (e.g., Tabachnick & Fidell, 2001; Field, 2009), and are even employed when the substantive analyses do not use NHST (e.g., item response theory; Lord, 1980). Similar to the way in which Popper suggests to evaluate theories (Popper, 1959), null hypotheses are considered hypotheses that have not yet successfully been rejected, but should not be considered to be likely to be true

Background of the NHST framework

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Psychonomic Bulletin & Review	Publication Date: Feb 23, 2018
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

Why checking model assumptions using null hypothesis significance tests does not suffice: A plea for plausibility

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Psychonomic Bulletin & Review

Lead the way for us

Similar Papers

Are there benefits from NHST?
Frank Schmidt ... John Hunter
American Psychologist | VOL. 57
Frank Schmidt, et. al.Frank Schmidt ... John Hunter
01 Jan 2002
American Psychologist | VOL. 57

The researcher and the consultant: a dialogue on null hypothesis significance testing
Andreas Stang ... Charles Poole
European Journal of Epidemiology | VOL. 28
Andreas Stang, et. al.Andreas Stang ... Charles Poole
14 Nov 2013
European Journal of Epidemiology | VOL. 28

Effect size, confidence intervals and statistical power in psychological research
Arnoldo Téllez ... Victor Corral-Verdugo
Psychology in Russia: State of the Art | VOL. 8
Arnoldo Téllez, et. al.Arnoldo Téllez ... Victor Corral-Verdugo
01 Jan 2015
Psychology in Russia: State of the Art | VOL. 8

Dilemma of null hypothesis in ecological hypothesis's experiment test.
Ji Li
Ying yong sheng tai xue bao = The journal of applied ecology | VOL. 27
Ji LiJi Li
01 Jun 2016
Ying yong sheng tai xue bao = The journal of applied ecology | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Why checking model assumptions using null hypothesis significance tests does not suffice: A plea for plausibility

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Psychonomic Bulletin &amp; Review

More From: Psychonomic Bulletin & Review