Expected Sample Size Research Articles

Multi-arm, multi-stage trials frequently include a standard care to which all interventions are compared. This may increase costs and hinders comparisons among the experimental arms. Furthermore, the standard care may not be evident, particularly when there is a large variation in standard practice. Thus, we aimed to develop an adaptive clinical trial that drops ineffective interventions following an interim analysis before selecting the best intervention at the final stage without requiring a standard care. We used Bayesian methods to develop a multi-arm, two-stage adaptive trial and evaluated two different methods for ranking interventions, the probability that each intervention was optimal (Pbest) and using the surface under the cumulative ranking curve (SUCRA), at both the interim and final analysis. The proposed trial design determines the maximum sample size for each intervention using the Average Length Criteria. The interim analysis takes place at approximately half the pre-specified maximum sample size and aims to drop interventions for futility if either Pbest or the SUCRA is below a pre-specified threshold. The final analysis compares all remaining interventions at the maximum sample size to conclude superiority based on either Pbest or the SUCRA. The two ranking methods were compared across 12 scenarios that vary the number of interventions and the assumed differences between the interventions. The thresholds for futility and superiority were chosen to control type 1 error, and then the predictive power and expected sample size were evaluated across scenarios. A trial comparing three interventions that aim to reduce anxiety for children undergoing a laceration repair in the emergency department was then designed, known as the Anxiolysis for Laceration Repair in Children Trial (ALICE) trial. As the number of interventions increases, the SUCRA results in a higher predictive power compared with Pbest. Using Pbest results in a lower expected sample size when there is an effective intervention. Using the Average Length Criterion, the ALICE trial has a maximum sample size for each arm of 100 patients. This sample size results in a 86% and 85% predictive power using Pbest and the SUCRA, respectively. Thus, we chose Pbest as the ranking method for the ALICE trial. Bayesian ranking methods can be used in multi-arm, multi-stage trials with no clear control intervention. When more interventions are included, the SUCRA results in a higher power than Pbest. Future work should consider whether other ranking methods may also be relevant for clinical trial design.

Read full abstract

BackgroundGroup sequential designs incorporating the option to stop for futility at the time point of an interim analysis can save time and resources. Thereby, the choice of the futility boundary importantly impacts the design’s resulting performance characteristics, including the power and probability to correctly or wrongly stop for futility. Several authors contributed to the topic of selecting good futility boundaries. For binary endpoints, Simon’s designs (Control Clin Trials 10:1–10, 1989) are commonly used two-stage designs for single-arm phase II studies incorporating futility stopping. However, Simon’s optimal design frequently yields an undesirably high probability of falsely declaring futility after the first stage, and in Simon’s minimax design often a high proportion of the planned sample size is already evaluated at the interim analysis leaving only limited benefit in case of an early stop.MethodsThis work focuses on the optimality criteria introduced by Schüler et al. (BMC Med Res Methodol 17:119, 2017) and extends their approach to binary endpoints in single-arm phase II studies. An algorithm for deriving optimized futility boundaries is introduced, and the performance of study designs implementing this concept of optimal futility boundaries is compared to the common Simon’s minimax and optimal designs, as well as modified versions of these designs by Kim et al. (Oncotarget 10:4255–61, 2019).ResultsThe introduced optimized futility boundaries aim to maximize the probability of correctly stopping for futility in case of small or opposite effects while also setting constraints on the time point of the interim analysis, the power loss, and the probability of stopping the study wrongly, i.e. stopping the study even though the treatment effect shows promise. Overall, the operating characteristics, such as maximum sample size and expected sample size, are comparable to those of the classical and modified Simon’s designs and sometimes better. Unlike Simon’s designs, which have binding stopping rules, the optimized futility boundaries proposed here are not adjusted to exhaust the full targeted nominal significance level and are thus still valid for non-binding applications.ConclusionsThe choice of the futility boundary and the time point of the interim analysis have a major impact on the properties of the study design. Therefore, they should be thoroughly investigated at the planning stage. The introduced method of selecting optimal futility boundaries provides a more flexible alternative to Simon’s designs with non-binding stopping rules. The probability of wrongly stopping for futility is minimized and the optimized futility boundaries don’t exhibit the unfavorable properties of an undesirably high probability of falsely declaring futility or a high proportion of the planned sample evaluated at the interim time point.

Read full abstract

Expected Sample Size Research Articles

Related Topics

Articles published on Expected Sample Size

An automated platform trial framework for A/B testing

Sample size recalculation in three-stage clinical trials and its evaluation

Pengaruh Gaya Kepemimpinan terhadap Kinerja Karyawan melalui Disiplin Kerja pada PT. Padma Soode Indonesia

Cross-validated risk scores adaptive enrichment (CADEN) design

A multi-arm multi-stage platform design that allows preplanned addition of arms while still controlling the family-wise error.

Using an early outcome as the sole source of information of interim decisions regarding treatment effect on a long-term endpoint: The non-Gaussian case.

177Lu-edotreotide versus everolimus in patients with advanced neuroendocrine tumors of lung or thymic origin: The phase 3 randomized LEVEL, GETNE-T2217 trial.

Study on effect of anti-bribery management system to the quality of authority organization operational management in construction industry

A comparison of alternative ranking methods in two-stage clinical trials with multiple interventions: An application to the anxiolysis for laceration repair in children trial.

Empirical meropenem versus piperacillin/tazobactam for adult patients with sepsis (EMPRESS) trial: Protocol.

Optimal double acceptance sampling plans based on truncated life tests for Tsallis q-exponential distributions

Upstrapping to determine futility: predicting future outcomes nonparametrically from past data

The Bayesian Group-Sequential Predictive Evidence Value Design for Phase II Clinical Trials with Binary Endpoints

Optimal futility stopping boundaries for binary endpoints

The use of real-world data for clinical investigation of effectiveness in drug development

Team Communication and Employee Performance in Commercial Banks in Tanzania

Optimal timing for an accelerated interim futility analysis incorporating real world data

Optimal estimation of reliability parameter for inverse Pareto distribution with application to insurance data

Outcome of Olecranon Fracture Treated with Tension Band Wiring Versus Anatomical Locking Plate Fixation

Covariate adjustment in Bayesian adaptive randomized controlled trials.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Expected Sample Size Research Articles

Related Topics

Articles published on Expected Sample Size

An automated platform trial framework for A/B testing

Sample size recalculation in three-stage clinical trials and its evaluation

Pengaruh Gaya Kepemimpinan terhadap Kinerja Karyawan melalui Disiplin Kerja pada PT. Padma Soode Indonesia

Cross-validated risk scores adaptive enrichment (CADEN) design

A multi-arm multi-stage platform design that allows preplanned addition of arms while still controlling the family-wise error.

Using an early outcome as the sole source of information of interim decisions regarding treatment effect on a long-term endpoint: The non-Gaussian case.

177Lu-edotreotide versus everolimus in patients with advanced neuroendocrine tumors of lung or thymic origin: The phase 3 randomized LEVEL, GETNE-T2217 trial.

Study on effect of anti-bribery management system to the quality of authority organization operational management in construction industry

A comparison of alternative ranking methods in two-stage clinical trials with multiple interventions: An application to the anxiolysis for laceration repair in children trial.

Empirical meropenem versus piperacillin/tazobactam for adult patients with sepsis (EMPRESS) trial: Protocol.

Optimal double acceptance sampling plans based on truncated life tests for Tsallis q-exponential distributions

Upstrapping to determine futility: predicting future outcomes nonparametrically from past data

The Bayesian Group-Sequential Predictive Evidence Value Design for Phase II Clinical Trials with Binary Endpoints

Optimal futility stopping boundaries for binary endpoints

The use of real-world data for clinical investigation of effectiveness in drug development

Team Communication and Employee Performance in Commercial Banks in Tanzania

Optimal timing for an accelerated interim futility analysis incorporating real world data

Optimal estimation of reliability parameter for inverse Pareto distribution with application to insurance data

Outcome of Olecranon Fracture Treated with Tension Band Wiring Versus Anatomical Locking Plate Fixation

Covariate adjustment in Bayesian adaptive randomized controlled trials.