This study analyzes the performance of control charts using statistics to monitor the stability of the mean in paired data, in the presence of missing or censored data. To determine the control limits, extensive simulations of paired data with a normal distribution are conducted to establish an empirical distribution under different correlation scenarios, sample sizes, and percentages of censored or missing data. After calculating the control limits, additional simulations are carried out to assess the out of control average run length to detect changes in the mean. We employ the paired t-test and the paired Wilcoxon test when the population is normal as they obtain similar results, however if they differ significantly, thus it is an indication of a potential violation of the hypothesis of normality in the data. Additionally, we evaluated some proposals to minimize the negative effect of data displaying censoring and missing values. We found that the results using parametric and non-parametric approaches are practically similar when the data is complete; the non-parametric approach being more affected by missing and censored data. A practical example monitoring the most-streamed songs in the US illustrates the proposed procedure.
Read full abstract