Individualized conformal prediction: using synthetic data as relevant controls

  • Abstract
  • References
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

Individualized conformal prediction: using synthetic data as relevant controls

ReferencesShowing 10 of 21 papers
  • Open Access Icon
  • Cite Count Icon 3
  • 10.1080/01621459.2021.1947306
Individualized Group Learning
  • Aug 7, 2021
  • Journal of the American Statistical Association
  • Chencheng Cai + 2 more

  • Open Access Icon
  • Cite Count Icon 51
  • 10.1007/978-3-031-06649-8
Algorithmic Learning in a Random World
  • Jan 1, 2022
  • Vladimir Vovk + 2 more

  • Cite Count Icon 1
  • 10.1007/s12667-024-00684-6
Learning causality structures from electricity demand data
  • Jun 28, 2024
  • Energy Systems
  • Mariano Maisonnave + 4 more

  • 10.1145/3736575
Conformal Prediction: A Data Perspective
  • Sep 9, 2025
  • ACM Computing Surveys
  • Xiaofan Zhou + 3 more

  • Open Access Icon
  • Cite Count Icon 42
  • 10.1146/annurev-statistics-010814-020310
There Is Individualized Treatment. Why Not Individualized Inference?
  • Jun 1, 2016
  • Annual Review of Statistics and Its Application
  • Keli Liu + 1 more

  • Open Access Icon
  • Cite Count Icon 161
  • 10.1214/20-aos1965
Predictive inference with the jackknife+
  • Jan 29, 2021
  • The Annals of Statistics
  • Rina Foygel Barber + 3 more

  • Cite Count Icon 110
  • 10.1561/2200000101
Conformal Prediction: A Gentle Introduction
  • Jan 1, 2023
  • Foundations and Trends® in Machine Learning
  • Anastasios N Angelopoulos + 1 more

  • Open Access Icon
  • Cite Count Icon 8
  • 10.1002/9781118445112.stat08298
Divide‐and‐Conquer Methods for Big Data Analysis
  • Nov 23, 2021
  • Xueying Chen + 2 more

  • Open Access Icon
  • Cite Count Icon 492
  • 10.1080/01621459.2017.1307116
Distribution-Free Predictive Inference for Regression
  • Jun 8, 2018
  • Journal of the American Statistical Association
  • Jing Lei + 4 more

  • Open Access Icon
  • Cite Count Icon 21
  • 10.3390/cancers14163923
A Causal Framework for Making Individualized Treatment Decisions in Oncology.
  • Aug 14, 2022
  • Cancers
  • Pavlos Msaouel + 3 more

Similar Papers
  • Research Article
  • Cite Count Icon 2
  • 10.1016/j.eswa.2023.122322
Improving conformalized quantile regression through cluster-based feature relevance
  • Oct 31, 2023
  • Expert Systems With Applications
  • Martim Sousa + 2 more

Conformalized quantile regression, a cutting-edge and model-agnostic algorithm, has emerged as a recent innovation to generate valid prediction intervals on finite samples while addressing heteroscedasticity. It starts by employing quantile regression to estimate conditional quantiles. Subsequently, these estimated conditional quantiles undergo a rectification process using conformal prediction. Under the assumption of exchangeability, a slightly weaker form of independent and identically distributed (i.i.d.) data, the resulting prediction intervals are valid in finite samples. However, a drawback of the proposed conformalization step is identified: it lacks the capacity to adapt to heteroscedasticity due to its independence from the input. To overcome this limitation, we propose an improvement that involves partitioning the covariates space into clusters, assigning higher weights to features with greater predictive power. Following that, within each cluster, a conformal step is applied, leveraging a rectification that is reliant on the input cluster-wise.To demonstrate the superiority of our improved version over the classic version of conformalized quantile regression, we conducted a comprehensive comparison of their respective prediction intervals using synthetic data.

  • Research Article
  • Cite Count Icon 19
  • 10.1093/jrsssb/qkac004
Conformalized survival analysis
  • Jan 28, 2023
  • Journal of the Royal Statistical Society Series B: Statistical Methodology
  • Emmanuel Candès + 2 more

In this paper, we develop an inferential method based on conformal prediction, which can wrap around any survival prediction algorithm to produce calibrated, covariate-dependent lower predictive bounds on survival times. In the Type I right-censoring setting, when the censoring times are completely exogenous, the lower predictive bounds have guaranteed coverage in finite samples without any assumptions other than that of operating on independent and identically distributed data points. Under a more general conditionally independent censoring assumption, the bounds satisfy a doubly robust property which states the following: marginal coverage is approximately guaranteed if either the censoring mechanism or the conditional survival function is estimated well. The validity and efficiency of our procedure are demonstrated on synthetic data and real COVID-19 data from the UK Biobank.

  • Book Chapter
  • Cite Count Icon 4
  • 10.1007/978-3-030-63820-7_16
Trajectory Anomaly Detection Based on the Mean Distance Deviation
  • Jan 1, 2020
  • Xiaoyuan Hu + 2 more

With the development of science and technology and the explosive growth of data, there will be a lot of trajectories every day. However, how to detect the abnormal trajectory from many trajectories has become a hot issue. In order to study trajectory anomaly detection better, we analyze the Sequential conformal anomaly detection in trajectories based on hausdorff distance (SNN-CAD) method, and propose a new measurement method of trajectory distance Improved Moved Euclidean Distance (IMED) instead of Hausdorff distance, which reduces the computational complexity. In addition, we propose a removing-updating strategy to enhance the conformal prediction (CP). Then, we also put forward our Non-conformity measure (NCM), Mean Distance Deviation. It can enlarge the difference between trajectories more effectively, and detect the abnormal trajectory more accurately. Finally, based on the technical measures mentioned above and under the framework of enhanced conformal prediction theory detection, we also build our own detector called Mean Distance Deviation Detector (MDD-ECAD). Using a large number of synthetic trajectory data and real world trajectory data on two detectors, the experimental results show that MDD-ECAD is much better than SNN-CAD in both accuracy and running time.KeywordsConformal predictionNon-conformity measureTrajectory distance

  • Research Article
  • 10.62311/nesx/rp-30032023-29-40
Causal Time-Series with statsmodels: Conformal Risk Control under Regime Shifts
  • Mar 30, 2023
  • International Journal of Academic and Industrial Research Innovations(IJAIRI)
  • Murali Krishna Pasupuleti

Abstract: We develop a causal time-series framework using statsmodels to maintain calibrated predictive risk under regime shifts. Combining ARIMAX/VAR with drift detection and conformal risk control (CRC), we adapt the risk level α_t online to preserve target coverage while keeping intervals compact. On synthetic regime-switching data, CRC with change-point signals recovers nominal 90% coverage with 3–8% tighter intervals versus static conformal baselines and reduces false alerts. We present a conceptual model, mathematical formulation, and evaluation protocol aligned with regulatory expectations for model risk management. Keywords: Causal Time-Series; statsmodels; ARIMAX; VAR; Conformal Prediction; Risk Control; Change-Point Detection; Regime Shifts.

  • Research Article
  • 10.1145/3731681
PAGE: Domain-Incremental Adaptation with Past-Agnostic Generative Replay for Smart Healthcare
  • Apr 28, 2025
  • ACM Transactions on Computing for Healthcare
  • Chia-Hao Li + 1 more

Modern advances in machine learning (ML) and wearable medical sensors (WMSs) have enabled out-of-clinic disease detection. However, a trained ML model often suffers from misclassification when encountering non-stationary data domains after deployment. Use of continual learning (CL) strategies is a common way to perform domain-incremental adaptation while mitigating catastrophic forgetting. Nevertheless, most existing CL methods require access to previously learned domains through preservation of raw training data or distilled information. This is often infeasible in real-world scenarios due to storage limitations or data privacy, especially in smart healthcare applications. Moreover, it makes most existing CL algorithms inapplicable to deployed models in the field, thus incurring re-engineering costs. To address these challenges, we propose PAGE, a domain-incremental adaptation strategy with past-agnostic generative replay for smart healthcare. PAGE enables generative replay without the aid of any preserved data or information from prior domains. When adapting to a new domain, it exploits real data from the new distribution and the current model to generate synthetic data that retain the learned knowledge of previous domains. By replaying the synthetic data with the new real data during training, PAGE achieves a good balance between domain adaptation and knowledge retention. In addition, we incorporate an extended inductive conformal prediction (EICP) method into PAGE to produce a confidence score and a credibility value for each detection result. This makes the predictions interpretable and provides statistical guarantees for disease detection in smart healthcare applications. We demonstrate PAGE’s effectiveness in domain-incremental disease detection with three distinct disease datasets collected from commercially available WMSs. PAGE achieves highly competitive performance against state-of-the-art along with superior scalability, data privacy, and feasibility. Furthermore, PAGE is able to enable up to 75% reduction in clinical workload with the help of EICP.

  • Book Chapter
  • Cite Count Icon 4
  • 10.1007/978-3-030-67667-4_34
Prediction of Global Navigation Satellite System Positioning Errors with Guarantees
  • Jan 1, 2021
  • Alejandro Kuratomi + 2 more

Intelligent Transportation Systems employ different localization technologies, such as the Global Navigation Satellite System. This system transmits signals between satellite and receiver devices on the ground which can estimate their position on earth’s surface. The accuracy of this positioning estimate, or the positioning error estimation, is of utmost importance for the efficient and safe operation of autonomous vehicles, which require not only the position estimate, but also an estimation of their operation margin. This paper proposes a workflow for positioning error estimation using a random forest regressor along with a post-hoc conformal prediction framework. The latter is calibrated on the random forest out-of-bag samples to transform the obtained positioning error estimates into predicted integrity intervals, which are confidence intervals on the positioning error prediction with at least 99.999\(\%\) confidence. The performance is measured as the number of ground truth positioning errors inside the predicted integrity intervals. An extensive experimental evaluation is performed on real-world and synthetic data in terms of root mean square error between predicted and ground truth positioning errors. Our solution results in an improvement of 73\(\%\) compared to earlier research, while providing prediction statistical guarantees.

  • Research Article
  • Cite Count Icon 23
  • 10.1093/biomet/asz046
Fast exact conformalization of the lasso using piecewise linear homotopy
  • Sep 30, 2019
  • Biometrika
  • J Lei

SummaryConformal prediction is a general method that converts almost any point predictor to a prediction set. The resulting set retains the good statistical properties of the original estimator under standard assumptions, and guarantees valid average coverage even when the model is mis-specified. A main challenge in applying conformal prediction in modern applications is efficient computation, as it generally requires an exhaustive search over the entire output space. In this paper we develop an exact and computationally efficient conformalization of the lasso and elastic net. The method makes use of a novel piecewise linear homotopy of the lasso solution under perturbation of a single input sample point. As a by-product, we provide a simpler and better-justified online lasso algorithm, which may be of independent interest. Our derivation also reveals an interesting accuracy-stability trade-off in conformal inference, which is analogous to the bias-variance trade-off in traditional parameter estimation. The practical performance of the new algorithm is demonstrated in both synthetic and real data examples.

  • Research Article
  • Cite Count Icon 3
  • 10.1016/j.asoc.2020.106807
A conformal prediction inspired approach for distribution regression with random Fourier features
  • Oct 16, 2020
  • Applied Soft Computing
  • Di Wang + 4 more

A conformal prediction inspired approach for distribution regression with random Fourier features

  • Book Chapter
  • Cite Count Icon 1
  • 10.1007/978-3-642-33412-2_25
Identification of Confinement Regimes in Tokamak Plasmas by Conformal Prediction on a Probabilistic Manifold
  • Jan 1, 2012
  • Geert Verdoolaege + 3 more

Pattern recognition is becoming an increasingly important tool for making inferences from the massive amounts of data produced in magnetic confinement fusion experiments. However, the measurements obtained from the various plasma diagnostics are typically affected by a considerable statistical uncertainty. In this work, we consider the inherent stochastic nature of the data by modeling the measurements by probability distributions in a metric space. Information geometry permits the calculation of the geodesic distances on such manifolds, which we apply to the important problem of the classification of plasma confinement regimes. We use a distance-based conformal predictor, which we first apply to a synthetic data set. Next, the method yields an excellent classification performance with measurements from an international database. The conformal predictor also returns confidence and credibility measures, which are particularly important for interpretation of pattern recognition results in stochastic fusion data.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 10
  • 10.3390/s19010084
SHNN-CAD+: An Improvement on SHNN-CAD for Adaptive Online Trajectory Anomaly Detection
  • Dec 27, 2018
  • Sensors (Basel, Switzerland)
  • Yuejun Guo + 1 more

To perform anomaly detection for trajectory data, we study the Sequential Hausdorff Nearest-Neighbor Conformal Anomaly Detector (SHNN-CAD) approach, and propose an enhanced version called SHNN-CAD. SHNN-CAD was introduced based on the theory of conformal prediction dealing with the problem of online detection. Unlike most related approaches requiring several not intuitive parameters, SHNN-CAD has the advantage of being parameter-light which enables the easy reproduction of experiments. We propose to adaptively determine the anomaly threshold during the online detection procedure instead of predefining it without any prior knowledge, which makes the algorithm more usable in practical applications. We present a modified Hausdorff distance measure that takes into account the direction difference and also reduces the computational complexity. In addition, the anomaly detection is more flexible and accurate via a re-do strategy. Extensive experiments on both real-world and synthetic data show that SHNN-CAD outperforms SHNN-CAD with regard to accuracy and running time.

More from: Behaviormetrika
  • New
  • Research Article
  • 10.1007/s41237-025-00274-5
Parametric causal mediation analysis with asymmetric binary regression model
  • Nov 6, 2025
  • Behaviormetrika
  • Yuji Tsubota + 1 more

  • Research Article
  • 10.1007/s41237-025-00275-4
Reconciling functional data regression with excess bases
  • Sep 3, 2025
  • Behaviormetrika
  • Tomoya Wakayama + 1 more

  • Research Article
  • 10.1007/s41237-025-00270-9
Capsule network-driven feature extraction and ensemble learning for robust lung tumor classification
  • Aug 9, 2025
  • Behaviormetrika
  • V Nisha Jenipher + 1 more

  • Research Article
  • 10.1007/s41237-025-00269-2
Bridging the gap between machine learning and psychometrics
  • Aug 5, 2025
  • Behaviormetrika
  • Jonathan Templin

  • Research Article
  • 10.1007/s41237-025-00267-4
Introduction to the vol. 52, no. 2, 2025
  • Jul 18, 2025
  • Behaviormetrika
  • Maomi Ueno

  • Research Article
  • 10.1007/s41237-025-00268-3
Advances in multivariate data analysis
  • Jul 18, 2025
  • Behaviormetrika
  • Kohei Adachi + 1 more

  • Research Article
  • 10.1007/s41237-025-00266-5
Individualized conformal prediction: using synthetic data as relevant controls
  • Jul 15, 2025
  • Behaviormetrika
  • Fernando Delbianco + 1 more

  • Research Article
  • 10.1007/s41237-025-00265-6
Anti-ridge regression for communality estimation in factor analysis
  • Jul 5, 2025
  • Behaviormetrika
  • Haruhiko Ogasawara

  • Research Article
  • 10.1007/s41237-025-00260-x
Misspecifying non-compensatory as compensatory IRT: analysis of estimated skills and variance
  • Jul 5, 2025
  • Behaviormetrika
  • Hiroshi Tamano + 2 more

  • Research Article
  • 10.1007/s41237-025-00263-8
Assessing item-level fit for the sequential G-DINA model
  • Jun 14, 2025
  • Behaviormetrika
  • Pablo Nájera + 3 more

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon