Scientific self-correction: the Bayesian way

Felipe Romero,Jan Sprenger

doi:10.1007/s11229-020-02697-x

Felipe Romero, Jan Sprenger

Open Access

https://doi.org/10.1007/s11229-020-02697-x

Copy DOI

Journal: Synthese	Publication Date: Jun 29, 2020
Citations: 6	License type: open-access

Affiliation: University of Groningen, University of Turin

Abstract

The enduring replication crisis in many scientific disciplines casts doubt on the ability of science to estimate effect sizes accurately, and in a wider sense, to self-correct its findings and to produce reliable knowledge. We investigate the merits of a particular countermeasure—replacing null hypothesis significance testing (NHST) with Bayesian inference—in the context of the meta-analytic aggregation of effect sizes. In particular, we elaborate on the advantages of this Bayesian reform proposal under conditions of publication bias and other methodological imperfections that are typical of experimental research in the behavioral sciences. Moving to Bayesian statistics would not solve the replication crisis single-handedly. However, the move would eliminate important sources of effect size overestimation for the conditions we study.

Highlights

In recent years, several scientific disciplines have been facing a replication crisis: researchers fail to reproduce the results of previous experiments when copying the original experimental design
Numerous authors identify “classical” statistical inference based on Null Hypothesis Significance Testing (NHST) as a major cause of the replication crisis (Cohen 1994; Goodman 1999a; Ioannidis 2005; Ziliak and McCloskey 2008) and suggest statistical reforms
While science most likely needs a combination of these reforms to improve (e.g., Ioannidis 2005; Romero 2019), we study in this paper the case for statistical reform, and its interaction with various limitations in scientific research

Summary

Introduction

Several scientific disciplines have been facing a replication crisis: researchers fail to reproduce the results of previous experiments when copying the original experimental design. We ask whether the replicability of published research would change if we replaced the conventional NHST method by Bayesian inference To address this question, we conduct a systematic computer simulation study that investigates the self-corrective nature of science in the context of statistical inference. Since different statistical frameworks (e.g., NHST and Bayesian inference) classify the same set of experimental results in different qualitative categories, e.g., “strong evidence for the hypothesis”, “moderate evidence”, “inconclusive evidence”, etc., the dominant statistical framework will affect the form and extent of publication bias This affects, in turn, the accuracy of the meta-analytic effect size estimates and the validity of SCT*.

NHST and Bayesian inference

Model description and simulation design

Variable 1: sufficient versus limited resources

Variable 2: direction bias

Variable 3: suppressing inconclusive evidence

Results: the baseline condition

Extension 1: the probabilistic file drawer effect

Extension 2: a wider range of effect sizes

Discussion

Findings

Compliance with ethical standards

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scientific self-correction: the Bayesian way

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Synthese

Lead the way for us

Similar Papers

Are there benefits from NHST?
John Hunter ... Frank Schmidt
American Psychologist | VOL. 57
John Hunter, et. al.John Hunter ... Frank Schmidt
01 Jan 2002
American Psychologist | VOL. 57

Bayesian alternatives for common null-hypothesis significance tests in psychiatry: a non-technical guide using JASP
Daniel S Quintana ... Donald R Williams
BMC Psychiatry | VOL. 18
Daniel S Quintana, et. al.Daniel S Quintana ... Donald R Williams
07 Jun 2018
BMC Psychiatry | VOL. 18

Effect size, confidence intervals and statistical power in psychological research
Arnoldo Téllez ... Cirilo H García
Psychology in Russia: State of the Art | VOL. 8
Arnoldo Téllez, et. al.Arnoldo Téllez ... Cirilo H García
01 Jan 2015
Psychology in Russia: State of the Art | VOL. 8

Replacing P values with confidence intervals may not achieve anything
Seo Young Park
The Journal of Thoracic and Cardiovascular Surgery | VOL. 161
Seo Young ParkSeo Young Park
22 Jul 2020
The Journal of Thoracic and Cardiovascular Surgery | VOL. 161

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scientific self-correction: the Bayesian way

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Synthese