Suboptimal Predictions Research Articles

Machine learning offers new solutions for predicting life-threatening, unpredictable amiodarone-induced thyroid dysfunction. Traditional regression approaches for adverse-effect prediction without time-series consideration of features have yielded suboptimal predictions. Machine learning algorithms with multiple data sets at different time points may generate better performance in predicting adverse effects. We aimed to develop and validate machine learning models for forecasting individualized amiodarone-induced thyroid dysfunction risk and to optimize a machine learning-based risk stratification scheme with a resampling method and readjustment of the clinically derived decision thresholds. This study developed machine learning models using multicenter, delinked electronic health records. It included patients receiving amiodarone from January 2013 to December 2017. The training set was composed of data from Taipei Medical University Hospital and Wan Fang Hospital, while data from Taipei Medical University Shuang Ho Hospital were used as the external test set. The study collected stationary features at baseline and dynamic features at the first, second, third, sixth, ninth, 12th, 15th, 18th, and 21st months after amiodarone initiation. We used 16 machine learning models, including extreme gradient boosting, adaptive boosting, k-nearest neighbor, and logistic regression models, along with an original resampling method and 3 other resampling methods, including oversampling with the borderline-synthesized minority oversampling technique, undersampling-edited nearest neighbor, and over- and undersampling hybrid methods. The model performance was compared based on accuracy; Precision, recall, F1-score, geometric mean, area under the curve of the receiver operating characteristic curve (AUROC), and the area under the precision-recall curve (AUPRC). Feature importance was determined by the best model. The decision threshold was readjusted to identify the best cutoff value and a Kaplan-Meier survival analysis was performed. The training set contained 4075 patients from Taipei Medical University Hospital and Wan Fang Hospital, of whom 583 (14.3%) developed amiodarone-induced thyroid dysfunction, while the external test set included 2422 patients from Taipei Medical University Shuang Ho Hospital, of whom 275 (11.4%) developed amiodarone-induced thyroid dysfunction. The extreme gradient boosting oversampling machine learning model demonstrated the best predictive outcomes among all 16 models. The accuracy; Precision, recall, F1-score, G-mean, AUPRC, and AUROC were 0.923, 0.632, 0.756, 0.688, 0.845, 0.751, and 0.934, respectively. After readjusting the cutoff, the best value was 0.627, and the F1-score reached 0.699. The best threshold was able to classify 286 of 2422 patients (11.8%) as high-risk subjects, among which 275 were true-positive patients in the testing set. A shorter treatment duration; higher levels of thyroid-stimulating hormone and high-density lipoprotein cholesterol; and lower levels of free thyroxin, alkaline phosphatase, and low-density lipoprotein were the most important features. Machine learning models combined with resampling methods can predict amiodarone-induced thyroid dysfunction and serve as a support tool for individualized risk prediction and clinical decision support.

Read full abstract

Regional frequency analysis (AFR) brings together a variety of statistical methods aimed at predicting the behavior of extreme hydrological variables at ungauged sites. Regression techniques, geostatistical methods and classification are among the statistical tools frequently encountered in the literature. Methodologies based on these tools lead to regional models that offer a simple, but very useful description of the relationship between extreme hydrological variables and physiometeorological characteristics of a site. These regional models then make it possible to predict the behavior of variables of interest at places where no hydrological information is available. These methods are generally based on restrictive theoretical assumptions, including linearity and normality. These do not reflect the reality of natural phenomena. The general objectives of this paper are to identify the methods affected by these hypotheses, evaluate their impacts and propose improvements aimed at obtaining more realistic and fairer representations. Projection pursuit regression is a non-parametric method similar to generalized additive models and artificial neural networks that are considered in AFR to take into account the non-linearity of hydrological processes. In a comparative study, this paper shows that regression with revealing directions makes it possible to obtain more parsimonious models while preserving the same predictive power as the other nonparametric methods. Canonical Correlation Analysis (ACC) is used to create neighborhoods within which a model (e.g. multiple regression) is used to predict hydrologic variables at ungagged sites on the other hand, ACC strongly depends on the assumptions of normality and linearity. A new methodology for delineating neighborhoods is proposed in this paper and uses revealing direction regression to predict a reference point representing hydrological and physiometeorological information that is relevant to these groupings. The results show that the new methodology generalizes that of ACC, improves the homogeneity of neighborhoods and leads to better performance. In AFR, kriging techniques on transformed spaces are suggested in order to predict extreme hydrological variables. However, a transformation is required so that the hydrological variables of interest derive approximately from a multidimensional normal distribution. This transformation introduces a bias and leads to suboptimal predictions. Solutions have been proposed, but have not been tested in AFR. This paper proposes the approach of spatial copulas and shows that this approach provides satisfactory solutions to the problems encountered with kriging techniques. Max-stable processes are a theoretical formalization of spatial extremes and correspond to a more faithful representation of hydrological processes on the other hand; their characterization of extreme dependence poses technical problems which slow down their adoption. In this paper, the approximate Bayesian calculus is examined as a solution. The results of a simulation study show that the approximate Bayesian computation is superior to the standard approach of compound likelihood. In addition, this approach is more appropriate in order to take into account specification errors.

Read full abstract

Suboptimal Predictions Research Articles

Articles published on Suboptimal Predictions

Intelligent Financial Forecasting with Granger Causality and Correlation Analysis Using Bayesian Optimization and Long Short-Term Memory

Land Subsidence Predictions Based on a Multi-Component Temporal Convolutional Gated Recurrent Unit Model in Kunming City

UPI-LT: Enhancing Information Propagation Predictions in Social Networks Through User Influence and Temporal Dynamics

Generative adversarial networks for stack voltage degradation and RUL estimation in PEMFCs under static and dynamic loads

Struggling Models: An Analysis of Logistic Regression and Random Forest in Predicting Repeat Buyers with Imbalanced Performance Metrics

A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning

Hierarchical fuzzy regression functions for mixed predictors and an application to real estate price prediction

Study on the Equivalent Stiffness of a Local Resonance Metamaterial Concrete Unit Cell

Multi-source uncertainty propagation and sensitivity analysis of turbine blades with underplatform dampers

Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach

SigCo: Eliminate the inter-class competition via sigmoid for learning with noisy labels

A socially interdependent choice framework for social influences in healthcare decision-making: a study protocol

A Multi-Fidelity Successive Response Surface Method for Crashworthiness Optimization Problems

Attention-Based Depth Distillation with 3D-Aware Positional Encoding for Monocular 3D Object Detection

A complementary and contrastive network for stimulus segmentation and generalization

Explainable Machine Learning Techniques To Predict Amiodarone-Induced Thyroid Dysfunction Risk: Multicenter, Retrospective Study With External Validation

The autonomy‐validity dilemma in mechanical prediction procedures: The quest for a compromise

Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach.

Predicting Future Ranked Statistics and Recorded Values for Some Statistical Distributions

Sparse data-driven wavefront prediction for large-scale adaptive optics.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Suboptimal Predictions Research Articles

Articles published on Suboptimal Predictions

Intelligent Financial Forecasting with Granger Causality and Correlation Analysis Using Bayesian Optimization and Long Short-Term Memory

Land Subsidence Predictions Based on a Multi-Component Temporal Convolutional Gated Recurrent Unit Model in Kunming City

UPI-LT: Enhancing Information Propagation Predictions in Social Networks Through User Influence and Temporal Dynamics

Generative adversarial networks for stack voltage degradation and RUL estimation in PEMFCs under static and dynamic loads

Struggling Models: An Analysis of Logistic Regression and Random Forest in Predicting Repeat Buyers with Imbalanced Performance Metrics

A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning

Hierarchical fuzzy regression functions for mixed predictors and an application to real estate price prediction

Study on the Equivalent Stiffness of a Local Resonance Metamaterial Concrete Unit Cell

Multi-source uncertainty propagation and sensitivity analysis of turbine blades with underplatform dampers

Semi-supervised Learning of Dynamical Systems with Neural Ordinary Differential Equations: A Teacher-Student Model Approach

SigCo: Eliminate the inter-class competition via sigmoid for learning with noisy labels

A socially interdependent choice framework for social influences in healthcare decision-making: a study protocol

A Multi-Fidelity Successive Response Surface Method for Crashworthiness Optimization Problems

Attention-Based Depth Distillation with 3D-Aware Positional Encoding for Monocular 3D Object Detection

A complementary and contrastive network for stimulus segmentation and generalization

Explainable Machine Learning Techniques To Predict Amiodarone-Induced Thyroid Dysfunction Risk: Multicenter, Retrospective Study With External Validation

The autonomy‐validity dilemma in mechanical prediction procedures: The quest for a compromise

Crash Injury Severity Prediction Using an Ordinal Classification Machine Learning Approach.

Predicting Future Ranked Statistics and Recorded Values for Some Statistical Distributions

Sparse data-driven wavefront prediction for large-scale adaptive optics.