Informative Prior Distributions Research Articles

The coronavirus disease 2019 pandemic highlighted the need to conduct efficient randomized clinical trials with interim monitoring guidelines for efficacy and futility. Several randomized coronavirus disease 2019 trials, including the Multiplatform Randomized Clinical Trial (mpRCT), used Bayesian guidelines with the belief that they would lead to quicker efficacy or futility decisions than traditional "frequentist" guidelines, such as spending functions and conditional power. We explore this belief using an intuitive interpretation of Bayesian methods as translating prior opinion about the treatment effect into imaginary prior data. These imaginary observations are then combined with actual observations from the trial to make conclusions. Using this approach, we show that the Bayesian efficacy boundary used in mpRCT is actually quite similar to the frequentist Pocock boundary. The mpRCT's efficacy monitoring guideline considered stopping if, given the observed data, there was greater than 99% probability that the treatment was effective (odds ratio greater than 1). The mpRCT's futility monitoring guideline considered stopping if, given the observed data, there was greater than 95% probability that the treatment was less than 20% effective (odds ratio less than 1.2). The mpRCT used a normal prior distribution that can be thought of as supplementing the actual patients' data with imaginary patients' data. We explore the effects of varying probability thresholds and the prior-to-actual patient ratio in the mpRCT and compare the resulting Bayesian efficacy monitoring guidelines to the well-known frequentist Pocock and O'Brien-Fleming efficacy guidelines. We also contrast Bayesian futility guidelines with a more traditional 20% conditional power futility guideline. A Bayesian efficacy and futility monitoring boundary using a neutral, weakly informative prior distribution and a fixed probability threshold at all interim analyses is more aggressive than the commonly used O'Brien-Fleming efficacy boundary coupled with a 20% conditional power threshold for futility. The trade-off is that more aggressive boundaries tend to stop trials earlier, but incur a loss of power. Interestingly, the Bayesian efficacy boundary with 99% probability threshold is very similar to the classic Pocock efficacy boundary. In a pandemic where quickly weeding out ineffective treatments and identifying effective treatments is paramount, aggressive monitoring may be preferred to conservative approaches, such as the O'Brien-Fleming boundary. This can be accomplished with either Bayesian or frequentist methods.

Read full abstract

Two CO2 storage sites located in the western Norwegian North Sea (NNS), called Aurora and Smeaheia, are currently under construction and assessment respectively. In geological storage of CO2, the in situ minimum horizontal stress is an essential input parameter for assessment of both containment and induced seismic risks [1]. To infer the stress states at certain depths at a site where no data is available, the standard approach is to perform a classical linear regression on stress data versus depth and treat the fitted trend line as the best site-specific stress predictions along depth [2]. However, stress data are often highly limited at CO2 storage sites; for example, Aurora and Smeaheia have only five in situ stress measurements available at best respectively. Such limited data may severely underrepresent the true stress distribution at one site. Data scarcity coupled with measurement error and spatial variability poses a challenge to reliable stress prediction, and hence it is crucial to quantify and reduce uncertainty in site-specific stress prediction for CO2 storage. Stress uncertainty is actually the required input information in the more rational probabilistic risk assessment framework. A natural solution to reducing uncertainty is to integrate stress information from other sources. On the Norwegian continental shelf, extensive data has been accumulated from previous petroleum projects. Of the publicly available NPD stress database, Figure 1a shows the distribution of versus depth (< 3,000 m) for each site within the study area containing Aurora and Smeaheia, and reveals a certain degree of similarity between the stress trends at the 11 sites. Such similarity, aligned with other published results [2], may be attributed to the relaxed sedimentary basins where gravitational loading dominates the lateral stress distribution rather than tectonic components, with the between-site variation arising from differences in the geological conditions and pore pressures [3]. When facing limited data for a site like Aurora and Smeaheia, the current approach is often to either directly use the stress trend from other sites having richer data or expand the coverage area to include more data. Such semi-subjective information borrowing approach, although effective in many cases, may lead to overly confident stress predictions as it fails to account for possible between-site heterogeneity in stress trends. Bayesian inference has been widely used as a rigorous and powerful statistical approach for quantifying uncertainty, as well as combining information from different sources via informative prior distributions. Hence, historical stress data may be integrated into Bayesian analysis of site-specific data in the form of prior distributions, with stress uncertainties being quantified and updated as the posterior distributions [4, 5]. When developing prior distributions for site-specific stress prediction, it may be tempting to combine all historical stress data for a holistic Bayesian analysis, yet such complete pooling approach may give an overconfident summary of prior information in that it ignores the possible stress heterogeneity between sites. This paper presents a Bayesian hierarchical (i.e., partial pooling) model (BHM) that explicitly accounts for between-site heterogeneity/similarity when constructing prior distributions from historical stress data, and demonstrates how the proposed model effectively borrows historical information to reduce uncertainty in site-specific stress prediction for CO2 storage in the NNS study area. Figures 1b illustrates the prior predictions of versus depth at the Aurora site from the Bayesian complete and partial pooling models. Although the complete pooling model gives less uncertain stress predictions than the partial pooling model as indicated by the narrower 90% prediction intervals (PIs), it does not well capture the five unseen stress measurements at Aurora in that two out of five stress values fall outside the 90% PIs. This suggests that complete pooling analysis indeed gives overconfident prior distributions out of the NPD database, and is thus not suitable for integrating historical data into site-specific stress prediction for CO2 storage in the NNS. On the other hand, the partial pooling model gives fairly good prior predictions of the five unseen stress values at Aurora, albeit with larger uncertainties. This result demonstrates the effectiveness of BHM as a framework for formulating proper informative priors from historical data, and an encouraging implication is that probabilistic risk assessment is allowed even with no site-specific stress data at this storage site, which is not possible if external information is not integrated properly. Figure 1c shows the posterior stress predictions updated with the five site-specific stress values from the two Bayesian models in question. After incorporating the site-specific data, the complete pooling model still over-predicts the two stress values at depths with barely noticeable updating, while partial pooling gives considerably more accurate stress predictions with reduced uncertainty.

Read full abstract

Informative Prior Distributions Research Articles

Articles published on Informative Prior Distributions

Kinetic profile inference with outlier detection using support vector machine regression and Gaussian process regression

An extensive re-evaluation of evidence and analyses of the Randomised Badger Culling Trial (RBCT) I: Within proactive culling areas.

Optimized multi-point hemispherical grid model with adaptive grid division based on the prior information of multipath error

Residual squeeze-and-excitation convolutional auto-encoder for fault detection and diagnosis in complex industrial processes

A Bayesian approach to data-driven multi-stage stochastic optimization

Underwater image enhancement based on global features and prior distribution guided

Beyond the Classical Type I Error: Bayesian Metrics for Bayesian Designs Using Informative Priors

Comparison of Bayesian and frequentist monitoring boundaries motivated by the Multiplatform Randomized Clinical Trial.

Assessing risk factors of bypass graft surgery through the implementation of Bayesian and non-Bayesian methodologies

Exploring transportation equity issues for persons with disabilities: The impact of gender on mobility and accessibility indicators

Meta-Analysis of Normal Reference Values for Right and Left Ventricular Quantification by Cardiovascular Magnetic Resonance.

Inhibition of Receptor-Interacting Protein Kinase1 in Chronic Plaque Psoriasis: A Multicenter, Randomized, Double-Blind, Placebo-Controlled Study.

Complemented subspace-based weighted collaborative representation model for imbalanced learning

MetaNorm: incorporating meta-analytic priors into normalization of NanoString nCounter data.

Bayesian random-effects meta-analysis with empirical heterogeneity priors for application in health technology assessment with very few studies.

Robust incorporation of historical information with known type I error rate inflation.

Time-dependent reliability assessment of existing concrete bridges with varying knowledge levels by proof load testing

Discriminative multimodal learning via conditional priors in generative models

Incorporation of healthy volunteers data on receptor occupancy into a phase II proof-of-concept trial using a Bayesian dynamic borrowing design.

Reducing Uncertainty in Site-specific Stress Prediction for CO2 Storage in the North Sea, Norway

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Informative Prior Distributions Research Articles

Articles published on Informative Prior Distributions

Kinetic profile inference with outlier detection using support vector machine regression and Gaussian process regression

An extensive re-evaluation of evidence and analyses of the Randomised Badger Culling Trial (RBCT) I: Within proactive culling areas.

Optimized multi-point hemispherical grid model with adaptive grid division based on the prior information of multipath error

Residual squeeze-and-excitation convolutional auto-encoder for fault detection and diagnosis in complex industrial processes

A Bayesian approach to data-driven multi-stage stochastic optimization

Underwater image enhancement based on global features and prior distribution guided

Beyond the Classical Type I Error: Bayesian Metrics for Bayesian Designs Using Informative Priors

Comparison of Bayesian and frequentist monitoring boundaries motivated by the Multiplatform Randomized Clinical Trial.

Assessing risk factors of bypass graft surgery through the implementation of Bayesian and non-Bayesian methodologies

Exploring transportation equity issues for persons with disabilities: The impact of gender on mobility and accessibility indicators

Meta-Analysis of Normal Reference Values for Right and Left Ventricular Quantification by Cardiovascular Magnetic Resonance.

Inhibition of Receptor-Interacting Protein Kinase1 in Chronic Plaque Psoriasis: A Multicenter, Randomized, Double-Blind, Placebo-Controlled Study.

Complemented subspace-based weighted collaborative representation model for imbalanced learning

MetaNorm: incorporating meta-analytic priors into normalization of NanoString nCounter data.

Bayesian random-effects meta-analysis with empirical heterogeneity priors for application in health technology assessment with very few studies.

Robust incorporation of historical information with known type I error rate inflation.

Time-dependent reliability assessment of existing concrete bridges with varying knowledge levels by proof load testing

Discriminative multimodal learning via conditional priors in generative models

Incorporation of healthy volunteers data on receptor occupancy into a phase II proof-of-concept trial using a Bayesian dynamic borrowing design.

Reducing Uncertainty in Site-specific Stress Prediction for CO2 Storage in the North Sea, Norway