Modelling Diagnostic Validity Estimates from Administrative Health Data

Kristine Kroeker,Lisa M Lix,Saman Muthukumarana,Depeng Jiang

doi:10.23889/ijpds.v1i1.171

Kristine Kroeker, Lisa M Lix + Show 2 more

Open Access

https://doi.org/10.23889/ijpds.v1i1.171

Copy DOI

Abstract

ABSTRACT ObjectivesValidation studies compare diagnostic information in linked administrative and reference (i.e., gold standard) data; they are an essential tool to develop accurate case definitions, the rules used to identify individuals in administrative data with a specific health condition. Validation studies often estimate the accuracy of multiple case definitions, in order to identify the data features (e.g., diagnosis codes, type of data source) that influence accuracy estimates. Descriptive analyses are commonly used to select a case definition(s) with the greatest accuracy estimates, but fail to account for uncertainty in accuracy estimates. The objectives were to: (1) compare the performance of regression-based approaches to test for differences in diagnostic accuracy estimates, and (2) demonstrate how to apply and use these models. ApproachComputer simulation was used to compare three regression models: (a) univariate fixed-effects models applied to estimates of sensitivity and specificity; (b) univariate fixed-effects model for Youden's index, the average of sensitivity and the complement of specificity; and (c) bivariate random-effects joint model of sensitivity and specificity. The simulations varied the means and variances of sensitivity and specificity, the correlation between these parameters, and the number of case definitions. Performance was compared using: (a) bias (i.e., difference between estimated and observed mean), (b) mean squared error (MSE), the sum of the estimated variance and bias squared, and (c) 95% confidence interval (CI) coverage, the proportion of times the population mean is contained in the 95% CI. For objective 2, we applied the models to estimates of diagnostic accuracy from a published rheumatoid arthritis (RA) validation study with 61 case definitions. ResultsUnivariate models of sensitivity and specificity had lower bias than the bivariate model (e.g., univariate=1.8%, bivariate=2.2%). The bivariate model had a smaller MSE than the univariate models when sample size was large and there was a small correlation between sensitivity and specificity (e.g., univariate=3.4%, bivariate=2.6%). Across all scenarios, the univariate model for Youden’s index showed small bias (average=2.4%) and MSE (average=2.1%). For objective 2, the univariate models of sensitivity, specificity, and Youden’s index revealed multiple case definition features that were associated with estimates of RA diagnostic accuracy: 1+ diagnosis in hospital records, >1 diagnosis in physician claims, and 1+ diagnoses by a specialist physician. ConclusionsWe recommend the bivariate model when a validation study contains a large number of case definitions. When the data contain a small number of case definitions, univariate models are recommended.

Highlights

Modelling Diagnostic Validity Estimates from Administrative Health Data
Performance was compared using: (a) bias, (b) mean squared error (MSE), the sum of the estimated variance and bias squared, and (c) 95% confidence interval (CI) coverage, the proportion of times the population mean is contained in the 95% CI
The bivariate model had a smaller MSE than the univariate models when sample size was large and there was a small correlation between sensitivity and specificity

Summary

Introduction

Modelling Diagnostic Validity Estimates from Administrative Health Data Kristine1*, Lix, Lisa M.1, Jiang, Depeng1, and Muthukumarana, Saman1 Validation studies compare diagnostic information in linked administrative and reference (i.e., gold standard) data; they are an essential tool to develop accurate case definitions, the rules used to identify individuals in administrative data with a specific health condition.

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modelling Diagnostic Validity Estimates from Administrative Health Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Population Data Science

Lead the way for us

Journal: International Journal of Population Data Science	Publication Date: Apr 18, 2017
License type: CC BY-NC-ND 4.0

Similar Papers

Model-based methods for case definitions from administrative health data: application to rheumatoid arthritis
Kristine Kroeker ... Lisa M Lix
BMJ Open | VOL. 7
Kristine Kroeker, et. al.Kristine Kroeker ... Lisa M Lix
01 Jun 2017
BMJ Open | VOL. 7

Analysis of litter size and days to lambing in the Ripollesa ewe. I. Comparison of models with linear and threshold approaches1
J Casellas ... A Ferret
Journal of Animal Science | VOL. 85
J Casellas, et. al.J Casellas ... A Ferret
13 Oct 2006
Journal of Animal Science | VOL. 85

Copula-based bivariate count data regression models for simultaneous estimation of crash counts based on severity and number of vehicles
Numan Ahmad ... Eric T Donnell
Accident Analysis & Prevention | VOL. 181
Numan Ahmad, et. al.Numan Ahmad ... Eric T Donnell
21 Dec 2022
Accident Analysis & Prevention | VOL. 181

Genetic evaluation of calving to first insemination using natural and artificial insemination mating data.
K A Donoghue ... J K Bertrand
Journal of Animal Science | VOL. 82
K A Donoghue, et. al.K A Donoghue ... J K Bertrand
01 Feb 2004
Journal of Animal Science | VOL. 82

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modelling Diagnostic Validity Estimates from Administrative Health Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Population Data Science