Bias in trials comparing paired continuous tests can cause researchers to choose the wrong screening modality

Deborah H Glueck,Brandy M Ringham,John T Brinton,John M Lewin,Keith E Muller,Colin I O'Donnell,Todd A Alonzo,Molly M Lamb,Etta D Pisano

doi:10.1186/1471-2288-9-4

Abstract

BackgroundTo compare the diagnostic accuracy of two continuous screening tests, a common approach is to test the difference between the areas under the receiver operating characteristic (ROC) curves. After study participants are screened with both screening tests, the disease status is determined as accurately as possible, either by an invasive, sensitive and specific secondary test, or by a less invasive, but less sensitive approach. For most participants, disease status is approximated through the less sensitive approach. The invasive test must be limited to the fraction of the participants whose results on either or both screening tests exceed a threshold of suspicion, or who develop signs and symptoms of the disease after the initial screening tests.The limitations of this study design lead to a bias in the ROC curves we call paired screening trial bias. This bias reflects the synergistic effects of inappropriate reference standard bias, differential verification bias, and partial verification bias. The absence of a gold reference standard leads to inappropriate reference standard bias. When different reference standards are used to ascertain disease status, it creates differential verification bias. When only suspicious screening test scores trigger a sensitive and specific secondary test, the result is a form of partial verification bias.MethodsFor paired screening tests with bivariate normally distributed scores, we give formulae and programs to quantify the effect of paired screening trial bias on a paired comparison of area under the curves. We fix the prevalence of disease, and the chance a diseased subject manifests signs and symptoms. We derive the formulas for true sensitivity and specificity, and those for the sensitivity and specificity observed by the study investigator.ResultsThe observed area under the ROC curves is quite different from the true area under the ROC curves. The typical direction of the bias is a strong inflation in sensitivity, paired with a concomitant slight deflation of specificity.ConclusionIn paired trials of screening tests, when area under the ROC curve is used as the metric, bias may lead researchers to make the wrong decision as to which screening test is better.

Highlights

To compare the diagnostic accuracy of two continuous screening tests, a common approach is to test the difference between the areas under the receiver operating characteristic (ROC) curves
In paired trials of screening tests, when area under the ROC curve is used as the metric, bias may lead researchers to make the wrong decision as to which screening test is better
Paired trials designed to compare the diagnostic accuracy of screening tests using area under the receiver operating characteristic (ROC) curve may fall victim to a strong bias that renders the conclusions of the trial incorrect

Summary

Introduction

To compare the diagnostic accuracy of two continuous screening tests, a common approach is to test the difference between the areas under the receiver operating characteristic (ROC) curves. The limitations of this study design lead to a bias in the ROC curves we call paired screening trial bias. This bias reflects the synergistic effects of inappropriate reference standard bias, differential verification bias, and partial verification bias. Paired trials designed to compare the diagnostic accuracy of screening tests using area under the receiver operating characteristic (ROC) curve may fall victim to a strong bias that renders the conclusions of the trial incorrect. We consider area under the ROC curve, because it continues to be used as the standard in prominent medical journals [1,2,3]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Research Methodology	Publication Date: Jan 20, 2009
Citations: 16	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Bias in trials comparing paired continuous tests can cause researchers to choose the wrong screening modality

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology

Lead the way for us

Similar Papers

The Conduct and Reporting of Meta-Analyses of Studies of Diagnostic Tests, and a Consideration of ROC Curves: Answers to the January 2010 Journal Club Questions
Teri A Reynolds ... David L Schriger
Annals of Emergency Medicine | VOL. 55
Teri A Reynolds, et. al.Teri A Reynolds ... David L Schriger
21 May 2010
Annals of Emergency Medicine | VOL. 55

Clinical Practice Guidelines for Prenatal Aneuploidy Screening and Diagnostic Testing from Korean Society of Maternal-Fetal Medicine: (2) Invasive Diagnostic Testing for Fetal Chromosomal Abnormalities.
Ji Yeon Lee ... Minhyoung Kim
Journal of Korean medical science | VOL. 36
Ji Yeon Lee, et. al.Ji Yeon Lee ... Minhyoung Kim
01 Jan 2020
Journal of Korean medical science | VOL. 36

Adjusting for Partial Verification or Workup Bias in Meta-Analyses of Diagnostic Accuracy Studies
Joris A H De Groot ... Johannes B Reitsma
American Journal of Epidemiology | VOL. 175
Joris A H De Groot, et. al.Joris A H De Groot ... Johannes B Reitsma
15 Mar 2012
American Journal of Epidemiology | VOL. 175

Evidence based laboratory medicine for beginners: diagnostic accuracy studies and sources of bias
Chris Florkowski
Pathology | VOL. 45
Chris FlorkowskiChris Florkowski
01 Jan 2013
Pathology | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bias in trials comparing paired continuous tests can cause researchers to choose the wrong screening modality

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology