Valid Confidence Research Articles

After a machine learning model has been deployed into production, its predictive performance needs to be monitored. Ideally, such monitoring can be carried out by comparing the model’s predictions against ground truth labels. For this to be possible, the ground truth labels must be available relatively soon after inference. However, there are many use cases where ground truth labels are available only after a significant delay, or in the worst case, not at all. In such cases, directly monitoring the model’s predictive performance is impossible. Recently, novel methods for estimating the predictive performance of a model when ground truth is unavailable have been developed. Many of these methods leverage model confidence or other uncertainty estimates and are experimentally compared against a naive baseline method, namely Average Confidence (AC), which estimates model accuracy as the average of confidence scores for a given set of predictions. However, until now the theoretical properties of the AC method have not been properly explored. In this paper, we bridge this gap by reviewing the AC method and show that under certain general assumptions, it is an unbiased and consistent estimator of model accuracy. We also augment the AC method by deriving valid confidence intervals for the estimates it produces. These contributions elevate AC from an ad-hoc estimator to a principled one, encouraging its use in practice. We complement our theoretical results with empirical experiments, comparing AC against more complex estimators in a monitoring setting under covariate shift. We conduct our experiments using synthetic datasets, which allow for full control over the nature of the shift. Our experiments with binary classifiers show that the AC method is able to beat other estimators in many cases. However, the comparative quality of the different estimators is found to be heavily case-dependent.

Read full abstract

Abstract In order to develop strategies that can detect cancer earlier, we need to better understand the dynamics of cancer evolution during the human lifetime. Starting from gestation, this includes measuring both the timing of early somatic mutations that lead to clonal mosaicism as well as the growth rates of mutated clones that carry higher potential for malignant transformation. Continuous observation of this evolving clonal composition across a lifetime is not feasible, therefore mathematical models are needed to integrate data from longitudinal and cross-sectional samples and infer these dynamics. Stochastic models of clonal evolution offer such a framework and provide a means for predicting individual clone trajectories into the future. Furthermore, if measurements of evolution are to be utilized clinically for early detection and patient risk stratification, faster methods to apply these models to data are required. To this end, we derived methods using coalescent theory for single-cell DNA sequencing data that enable instantaneous estimation of the growth rate of a clone along with its confidence intervals. Our tool is available in an open source R package, cloneRate, and provides the first method for constructing valid confidence intervals of growth rates analytically without having to rely on Bayesian inference techniques and complex simulations. When applying our methods to recent datasets derived from normal and neoplastic human blood cells, we quantified increased fitness effects of multi-hit driver mutations and found that higher initial clone growth rates led to decreased time to cancer diagnosis in patients. We can also use these models to estimate other important biological parameters that are difficult to measure in vivo such as bounds for total number of hematopoietic stem cells and expansion rates in early development. Ultimately, the ability to easily and quickly parameterize clonal dynamics in individual patients will benefit future early detection efforts and screening strategies for many cancer types. Citation Format: Kit Curtius. Stochastic modeling of clonal evolution in carcinogenesis [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 2 (Late-Breaking, Clinical Trial, and Invited Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(7_Suppl):Abstract nr SY02-03.

Read full abstract

Valid Confidence Research Articles

Related Topics

Articles published on Valid Confidence

Confidence-based Estimators for Predictive Performance in Model Monitoring

Ordered correlation forest

Multiple Structural Breaks in Interactive Effects Panel Data Models

-Penalized Multinomial Regression: Estimation, Inference, and Prediction, With an Application to Risk Factor Identification for Different Dementia Subtypes.

An Effective and Small Sample-Size Valid Confidence Interval for Isotonic Dose–Response Curves by Inverting a Partial Likelihood Ratio Test

Doubly-robust and heteroscedasticity-aware sample trimming for causal inference

INFERENCE IN MILDLY EXPLOSIVE AUTOREGRESSIONS UNDER UNCONDITIONAL HETEROSKEDASTICITY

Test and Measure for Partial Mean Dependence Based on Machine Learning Methods

Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments

Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model

Simultaneous Confidence Intervals for Partially Identified Parameters

Double debiased transfer learning for adaptive Huber regression

Sufficient Conditions for Central Limit Theorems and Confidence Intervals for Randomized Quasi-Monte Carlo Methods

Abstract SY02-03: Stochastic modeling of clonal evolution in carcinogenesis

On Sample Size Needed for Block Bootstrap Confidence Intervals to Have Desired Coverage Rates

Analysis of Differentially Private Synthetic Data: A Measurement Error Approach

Resampling-based confidence intervals and bands for the average treatment effect in observational studies with competing risks

Online Statistical Inference for Stochastic Optimization via Kiefer-Wolfowitz Methods

An identification and testing strategy for proxy-SVARs with weak proxies

Asymptotics of K-Fold Cross Validation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Valid Confidence Research Articles

Related Topics

Articles published on Valid Confidence

Confidence-based Estimators for Predictive Performance in Model Monitoring

Ordered correlation forest

Multiple Structural Breaks in Interactive Effects Panel Data Models

-Penalized Multinomial Regression: Estimation, Inference, and Prediction, With an Application to Risk Factor Identification for Different Dementia Subtypes.

An Effective and Small Sample-Size Valid Confidence Interval for Isotonic Dose–Response Curves by Inverting a Partial Likelihood Ratio Test

Doubly-robust and heteroscedasticity-aware sample trimming for causal inference

INFERENCE IN MILDLY EXPLOSIVE AUTOREGRESSIONS UNDER UNCONDITIONAL HETEROSKEDASTICITY

Test and Measure for Partial Mean Dependence Based on Machine Learning Methods

Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments

Joint Statistical Inference for the Area under the ROC Curve and Youden Index under a Density Ratio Model

Simultaneous Confidence Intervals for Partially Identified Parameters

Double debiased transfer learning for adaptive Huber regression

Sufficient Conditions for Central Limit Theorems and Confidence Intervals for Randomized Quasi-Monte Carlo Methods

Abstract SY02-03: Stochastic modeling of clonal evolution in carcinogenesis

On Sample Size Needed for Block Bootstrap Confidence Intervals to Have Desired Coverage Rates

Analysis of Differentially Private Synthetic Data: A Measurement Error Approach

Resampling-based confidence intervals and bands for the average treatment effect in observational studies with competing risks

Online Statistical Inference for Stochastic Optimization via Kiefer-Wolfowitz Methods

An identification and testing strategy for proxy-SVARs with weak proxies

Asymptotics of K-Fold Cross Validation