Abstract

A tolerance interval is a statistical interval that covers at least 100ρ% of the population of interest with a 100(1−α)% confidence, where ρ and α are pre-specified values in (0, 1). In many scientific fields, such as pharmaceutical sciences, manufacturing processes, clinical sciences, and environmental sciences, tolerance intervals are used for statistical inference and quality control. Despite the usefulness of tolerance intervals, the procedures to compute tolerance intervals are not commonly implemented in statistical software packages. This paper aims to provide a comparative study of the computational procedures for tolerance intervals in some commonly used statistical software packages including JMP, Minitab, NCSS, Python, R, and SAS. On the other hand, we also investigate the effect of misspecifying the underlying probability model on the performance of tolerance intervals. We study the performance of tolerance intervals when the assumed distribution is the same as the true underlying distribution and when the assumed distribution is different from the true distribution via a Monte Carlo simulation study. We also propose a robust model selection approach to obtain tolerance intervals that are relatively insensitive to the model misspecification. We show that the proposed robust model selection approach performs well when the underlying distribution is unknown but candidate distributions are available.

Highlights

  • There are three types of statistical intervals commonly used in practice: confidence interval, prediction interval, and tolerance interval

  • 5.1 Model selection based on maximum likelihood we propose a simple model selection approach based on the maximum likelihood for the construction of tolerance intervals under model uncertainty in order

  • 7 Concluding remarks In this paper, we discuss the computation of tolerance intervals available in commonly used statistical software packages including JMP, Minitab, NCSS, Python, R, and SAS

Read more

Summary

Introduction

There are three types of statistical intervals commonly used in practice: confidence interval, prediction interval, and tolerance interval. A tolerance interval covers at least a specified proportion, ρ (0 ≤ ρ ≤ 1), of the population with a specified degree of confidence, 100(1 − α)% with 0 ≤ α ≤ 1 (Hahn and Meeker 1991). Suppose the lower tolerance bound based on a normal distribution is 1085.947, so the engineer can claim that at least 99% of all the light bulbs exceed approximately 1086 hours of burn time with 95% confidence (Minitab 18 Statistical Software 2017). Despite the usefulness of tolerance intervals, the computation of tolerance intervals based on different distributional assumptions is not commonly implemented in statistical software packages.

Basics of tolerance intervals
Statistical software packages for tolerance intervals
Effect of model misspecification on tolerance intervals
Findings
Proposed model selection approach
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call