How Confident Are We about Observational Findings in Healthcare: A Benchmark Study.

Martijn J Schuemie,Patrick B Ryan,David Madigan,Marc A Suchard,George Hripcsak,Yuxi Tian Alejandro Schuler,Jianxiao Yang,M Soledad Cepede

doi:10.1162/99608f92.147cc28e

Martijn J Schuemie, Patrick B Ryan + Show 6 more

Open Access

https://doi.org/10.1162/99608f92.147cc28e

Copy DOI

Journal: Harvard Data Science Review	Publication Date: Jan 31, 2020
Citations: 41	License type: cc-by

Affiliation: Columbia University

Abstract

Healthcare professionals increasingly rely on observational healthcare data, such as administrative claims and electronic health records, to estimate the causal effects of interventions. However, limited prior studies raise concerns about the real-world performance of the statistical and epidemiological methods that are used. We present the "OHDSI Methods Benchmark" that aims to evaluate the performance of effect estimation methods on real data. The benchmark comprises a gold standard, a set of metrics, and a set of open source software tools. The gold standard is a collection of real negative controls (drug-outcome pairs where no causal effect appears to exist) and synthetic positive controls (drug-outcome pairs that augment negative controls with simulated causal effects). We apply the benchmark using four large healthcare databases to evaluate methods commonly used in practice: the new-user cohort, self-controlled cohort, case-control, case-crossover, and self-controlled case series designs. The results confirm the concerns about these methods, showing that for most methods the operating characteristics deviate considerably from nominal levels. For example, in most contexts, only half of the 95% confidence intervals we calculated contain the corresponding true effect size. We previously developed an "empirical calibration" procedure to restore these characteristics and we also evaluate this procedure. While no one method dominates, self-controlled methods such as the empirically calibrated self-controlled case series perform well across a wide range of scenarios.

Highlights

Observational health care data, such as administrative claims and electronic health records, offer opportunities to generate real-world evidence about the effect of treatments that can meaningfully improve the lives of patients
We present the “Observational Health Data Sciences and Informatics (OHDSI) Methods Benchmark” that aims to evaluate the performance of effect estimation methods on real data
We execute all 28 design variants of the five estimation methods on all 800 controls against the four databases, both with and without empirical calibration, producing a total of 179,200 effect size estimates. From these we derive a large set of performance metrics, which vary depending on choices of which controls and data to include in the evaluation

Summary

Introduction

Observational health care data, such as administrative claims and electronic health records, offer opportunities to generate real-world evidence about the effect of treatments that can meaningfully improve the lives of patients. Even though health care researchers have had access to large-scale observational databases for at least two decades, the literature still abounds with divergent opinions about the value of observational studies. In the face of conflicting evidence, decision-makers are faced with making the subjective determination of which study results to trust; many decide to dismiss observational evidence completely. Little empirical evidence exists to guide decisions about when and how to use observational studies. If the field of observational research is to mature from an artisanal pursuit devoid of any established performance characteristics into a true data science, further methodological work is required to quantify the reliability of the generated evidence.

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How Confident Are We about Observational Findings in Healthcare: A Benchmark Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Harvard Data Science Review

Lead the way for us

Similar Papers

A Comparison of the Empirical Performance of Methods for a Risk Identification System
Patrick B Ryan ... J Marc Overhage
Drug Safety | VOL. 36
Patrick B Ryan, et. al.Patrick B Ryan ... J Marc Overhage
01 Oct 2013
Drug Safety | VOL. 36

Empirical Performance of the Self-Controlled Case Series Design: Lessons for Developing a Risk Identification and Analysis System
Marc A Suchard ... Martijn J Schuemie
Drug Safety | VOL. 36
Marc A Suchard, et. al.Marc A Suchard ... Martijn J Schuemie
01 Oct 2013
Drug Safety | VOL. 36

Evaluating efficiency and statistical power of self-controlled case series and self-controlled risk interval designs in vaccine safety
Rongxia Li ... Eric Weintraub
Journal of Biopharmaceutical Statistics | VOL. 26
Rongxia Li, et. al.Rongxia Li ... Eric Weintraub
08 Jan 2016
Journal of Biopharmaceutical Statistics | VOL. 26

Proton pump inhibitors and the risk of pneumonia: a comparison of cohort and self-controlled case series designs
Emmae N Ramsay ... Nicole L Pratt
BMC Medical Research Methodology | VOL. 13
Emmae N Ramsay, et. al.Emmae N Ramsay ... Nicole L Pratt
24 Jun 2013
BMC Medical Research Methodology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How Confident Are We about Observational Findings in Healthcare: A Benchmark Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Harvard Data Science Review