Reproducibility of the STARD checklist: an instrument to assess the quality of reporting of diagnostic accuracy studies

Nynke Smidt,Anne Ws Rutjes,Patrick M Bossuyt,Henrica Cw De Vet,Johannes B Reitsma,Daniëlle Awm Van Der Windt,Raymond Wjg Ostelo,Lex M Bouter

doi:10.1186/1471-2288-6-12

Abstract

BackgroundIn January 2003, STAndards for the Reporting of Diagnostic accuracy studies (STARD) were published in a number of journals, to improve the quality of reporting in diagnostic accuracy studies. We designed a study to investigate the inter-assessment reproducibility, and intra- and inter-observer reproducibility of the items in the STARD statement.MethodsThirty-two diagnostic accuracy studies published in 2000 in medical journals with an impact factor of at least 4 were included. Two reviewers independently evaluated the quality of reporting of these studies using the 25 items of the STARD statement. A consensus evaluation was obtained by discussing and resolving disagreements between reviewers. Almost two years later, the same studies were evaluated by the same reviewers. For each item, percentages agreement and Cohen's kappa between first and second consensus assessments (inter-assessment) were calculated. Intraclass Correlation coefficients (ICC) were calculated to evaluate its reliability.ResultsThe overall inter-assessment agreement for all items of the STARD statement was 85% (Cohen's kappa 0.70) and varied from 63% to 100% for individual items. The largest differences between the two assessments were found for the reporting of the rationale of the reference standard (kappa 0.37), number of included participants that underwent tests (kappa 0.28), distribution of the severity of the disease (kappa 0.23), a cross tabulation of the results of the index test by the results of the reference standard (kappa 0.33) and how indeterminate results, missing data and outliers were handled (kappa 0.25). Within and between reviewers, also large differences were observed for these items. The inter-assessment reliability of the STARD checklist was satisfactory (ICC = 0.79 [95% CI: 0.62 to 0.89]).ConclusionAlthough the overall reproducibility of the quality of reporting on diagnostic accuracy studies using the STARD statement was found to be good, substantial disagreements were found for specific items. These disagreements were not so much caused by differences in interpretation of the items by the reviewers but rather by difficulties in assessing the reporting of these items due to lack of clarity within the articles. Including a flow diagram in all reports on diagnostic accuracy studies would be very helpful in reducing confusion between readers and among reviewers.

Highlights

Introduction[1,3,4,5,6,7] To remedy this, guidelines have been developed to improve the reporting of randomised controlled trials (CONSORT), diagnostics accuracy studies (STARD), systematic reviews of randomised controlled trials (QUOROM) and observational studies (MOOSE)
State the research questions or study aims, such as estimating diagnostic accuracy 27 (84) 31 (97)0.30 or comparing accuracy between tests or across participant groups.The study population: The inclusion and exclusion criteria, setting and locations 17 (53) 10 (31)0.57 where data were collected.NA from previous tests, or the fact that the participants had received the index tests or the reference standard?Participant sampling: Was the study population a consecutive series of participants 20 (63) 25 (78)0.64 defined by the selection criteria in item 3 and 4? If not, specify how participants were further selected
After the publication of the CONSORT statement in 1996, Moher et al evaluated the quality of reporting in 211 randomised controlled trails published in British Medical Journal, the Journal of the American Medical Association, the Lancet, and the New England Journal of Medicine by using the CONSORT checklist

Summary

Introduction

[1,3,4,5,6,7] To remedy this, guidelines have been developed to improve the reporting of randomised controlled trials (CONSORT), diagnostics accuracy studies (STARD), systematic reviews of randomised controlled trials (QUOROM) and observational studies (MOOSE). We have evaluated the quality of reporting of 124 diagnostic accuracy studies published in 2000 (PreSTARD evaluation) in 12 medical journals, using the items of the STARD statement. In order to evaluate the improvement of the quality of reporting of diagnostic accuracy studies published after the STARD statement, knowledge of the reproducibility of the assessment of the STARD checklist is needed. Our objective was to investigate the interassessment reproducibility of evaluating the quality of reporting of diagnostic accuracy studies published in 2000, using the items of the STARD statement. The intra- and inter-observer reproducibility was calculated to gain more insight into the sources of variation

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Medical Research Methodology	Publication Date: Mar 15, 2006
Citations: 80	License type: cc-by

R Discovery Prime

R Discovery Prime

Reproducibility of the STARD checklist: an instrument to assess the quality of reporting of diagnostic accuracy studies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology

Lead the way for us

Similar Papers

Assessment of Adherence to the STARD Statement for the Quality of Reports on Diagnostic Studies Published on the Iranian Journal of Radiology
Fariba Zarei ... Banafsheh Zeinali-Rafsanjani
Iranian Journal of Radiology | VOL. Special iss
Fariba Zarei, et. al.Fariba Zarei ... Banafsheh Zeinali-Rafsanjani
13 Apr 2017
Iranian Journal of Radiology | VOL. Special iss

Endorsement of the STARD Statement by Biomedical Journals: Survey of Instructions for Authors
Henrica De Vet ... Patrick Bossuyt
Clinical Chemistry | VOL. 53
Henrica De Vet, et. al.Henrica De Vet ... Patrick Bossuyt
01 Nov 2007
Clinical Chemistry | VOL. 53

Reporting Guidelines: Looking Back From the Future
Erik Von Elm ... Douglas G Altman
Chest | VOL. 134
Erik Von Elm, et. al.Erik Von Elm ... Douglas G Altman
01 Oct 2008
Chest | VOL. 134

A systematic review of the PTSD Checklist's diagnostic accuracy studies using QUADAS.
Scott D Mcdonald ... Patrick S Calhoun
Psychological trauma : theory, research, practice and policy | VOL. 7
Scott D Mcdonald, et. al.Scott D Mcdonald ... Patrick S Calhoun
01 Jan 2015
Psychological trauma : theory, research, practice and policy | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reproducibility of the STARD checklist: an instrument to assess the quality of reporting of diagnostic accuracy studies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Research Methodology