Abstract

Reference intervals are essential for the interpretation of laboratory test results in medicine. We propose a novel indirect approach to estimate reference intervals from real-world data as an alternative to direct methods, which require samples from healthy individuals. The presented refineR algorithm separates the non-pathological distribution from the pathological distribution of observed test results using an inverse approach and identifies the model that best explains the non-pathological distribution. To evaluate its performance, we simulated test results from six common laboratory analytes with a varying location and fraction of pathological test results. Estimated reference intervals were compared to the ground truth, an alternative indirect method (kosmic), and the direct method (N = 120 and N = 400 samples). Overall, refineR achieved the lowest mean percentage error of all methods (2.77%). Analyzing the amount of reference intervals within ± 1 total error deviation from the ground truth, refineR (82.5%) was inferior to the direct method with N = 400 samples (90.1%), but outperformed kosmic (70.8%) and the direct method with N = 120 (67.4%). Additionally, reference intervals estimated from pediatric data were comparable to published direct method studies. In conclusion, the refineR algorithm enables precise estimation of reference intervals from real-world data and represents a viable complement to the direct method.

Highlights

  • Reference intervals are essential for the interpretation of laboratory test results in medicine

  • We provide a high-performance and robust open-source implementation of an indirect method to accurately estimate reference intervals using real-world data (RWD), which is available as an R-package on CRAN

  • Shows the distribution of the estimated reference intervals in presence of abnormally low and high values in the dataset. (a–d) The rows represent the pathological distribution added on the left side, and the columns the pathological distribution added on the right side of the non-pathological one

Read more

Summary

Introduction

Reference intervals are essential for the interpretation of laboratory test results in medicine. We propose a novel indirect approach to estimate reference intervals from real-world data as an alternative to direct methods, which require samples from healthy individuals. Several indirect methods have already been implemented, e.g. the Hoffmann ­approach[15] and the Bhattacharya ­method[16] Both are limited to a Gaussian distribution of non-pathological results, which is not applicable for most cases. The TMC (Truncated Minimum chi-square) operates on interval data and minimizes the chi-square (χ2) distance between the estimated and the observed counts within a truncation ­interval[21] Both methods are implemented using Microsoft Excel and the R software environment, which leads to a sub-optimal usability in practice. For some datasets with unfavorable characteristics, computation time can be negatively ­impacted[22]

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.