Traditional Statistical Models Research Articles

Abstract Introduction Machine learning (ML) and Deep learning (DL) neural networks have been used to predict which patients will respond to atrial fibrillation (AF) ablation. However, the predictive results of DL are modest compared to traditional statistical models of clinical features with or without intracardiac electrograms. Purpose To test the hypothesis that optimized ML architectures will better identify patients who respond to AF ablation using intracardiac EGM than other DL architectures in our large registry of 320 consecutive patients. Methods Our patients had age 65.1±10.4Y, were 25% women, 61.6% non-paroxysmal, with intracardiac EGM recorded from multipolar catheters and 64-pole baskets (Fig A). Patients were propensity matched into those in whom ablation terminated by ablation (N=160, "Term") and those without termination (N=160, "Non-Term"). Using a proven DL architecture, ResNet, we systematically varied the depth (i.e., number of residual layers (D)) and width (i.e., number of nodes per layer (W)). We predicted termination (Fig C), presenting EGMs to training cohorts and testing in separate cohorts in a 6-fold cross-validation which we repeated 10 times for each fold with different randomization. Results Fig B illustrates that the prediction accuracy is generally negatively correlated with the network depth but positively correlated with the width. Specifically, the deep architecture (D=4, W=256, with 265M trainable parameters, blue-circled in Fig B) predicted with AUC = 0.61±0.08. However, a shallow-narrow architecture (D=1, W=16, green-circled in Fig B) achieved same AUC (0.61±0.09, p=0.65), while reducing the number of parameters by 97% compared to the DL architecture (79K vs. 265M). Additionally, a shallow-wide architecture (D=1, W=2048, red-circled in Fig B) achieved higher AUC (0.74±0.08; p&lt;0.0001), while saving 44% of the parameters (148M vs 265M) and 26% training time compared to the deep architecture. Conclusion Shallow neural network classification of raw EGMs predicted ablation response with better accuracy than other architectures. Future work should determine if deep architectures ultimately ‘overfit’ detailed EGM data, which may explain their lower success.Figure

Read full abstract

Breast cancer is the most common cancer and the most common cause of cancer death in women. Although survival rates have improved, unmet psychosocial needs remain challenging because the quality of life (QoL) and QoL-related factors change over time. In addition, traditional statistical models have limitations in identifying factors associated with QoL over time, particularly concerning the physical, psychological, economic, spiritual, and social dimensions. This study aimed to identify patient-centered factors associated with QoL among patients with breast cancer using a machine learning (ML) algorithm to analyze data collected along different survivorship trajectories. The study used 2 data sets. The first data set was the cross-sectional survey data from the Breast Cancer Information Grand Round for Survivorship (BIG-S) study, which recruited consecutive breast cancer survivors who visited the outpatient breast cancer clinic at the Samsung Medical Center in Seoul, Korea, between 2018 and 2019. The second data set was the longitudinal cohort data from the Beauty Education for Distressed Breast Cancer (BEST) cohort study, which was conducted at 2 university-based cancer hospitals in Seoul, Korea, between 2011 and 2016. QoL was measured using European Organization for Research and Treatment of Cancer QoL Questionnaire Core 30 questionnaire. Feature importance was interpreted using Shapley Additive Explanations (SHAP). The final model was selected based on the highest mean area under the receiver operating characteristic curve (AUC). The analyses were performed using the Python 3.7 programming environment (Python Software Foundation). The study included 6265 breast cancer survivors in the training data set and 432 patients in the validation set. The mean age was 50.6 (SD 8.66) years and 46.8% (n=2004) had stage 1 cancer. In the training data set, 48.3% (n=3026) of survivors had poor QoL. The study developed ML models for QoL prediction based on 6 algorithms. Performance was good for all survival trajectories: overall (AUC 0.823), baseline (AUC 0.835), within 1 year (AUC 0.860), between 2 and 3 years (AUC 0.808), between 3 and 4 years (AUC 0.820), and between 4 and 5 years (AUC 0.826). Emotional and physical functions were the most important features before surgery and within 1 year after surgery, respectively. Fatigue was the most important feature between 1 and 4 years. Despite the survival period, hopefulness was the most influential feature on QoL. External validation of the models showed good performance with AUCs between 0.770 and 0.862. The study identified important factors associated with QoL among breast cancer survivors across different survival trajectories. Understanding the changing trends of these factors could help to intervene more precisely and timely, and potentially prevent or alleviate QoL-related issues for patients. The good performance of our ML models in both training and external validation sets suggests the potential use of this approach in identifying patient-centered factors and improving survivorship care.

Read full abstract

Traditional Statistical Models Research Articles

Articles published on Traditional Statistical Models

Sample size and predictive performance of machine learning methods with survival data: A simulation study.

Forecasting the Acute Heart Failure Admissions: Development of Deep Learning Prediction Model Incorporating the Climate Information

Predicting success of atrial fibrillation ablation: comparing machine learning approaches of intracardiac electrograms

Intra-day solar irradiation forecast using machine learning with satellite data

Retrieval of suspended sediment concentrations using remote sensing and machine learning methods: A case study of the lower Yellow River

Application of Machine Learning Based on Structured Medical Data in Gastroenterology.

Prediction for the Sluice Deformation Based on SOA-LSTM-Weighted Markov Model

Monthly Runoff Prediction by Combined Models Based on Secondary Decomposition at the Wulong Hydrological Station in the Yangtze River Basin

Deep learning methods used in movie recommendation systems

Accident severity prediction modeling for road safety using random forest algorithm: an analysis of Indian highways.

How does the built environment affect hotel prices? A study using multiscale GWR and deep learning

Quality by Design (QbD) and Design of Experiments (DOE) as a Strategy for Tuning Lipid Nanoparticle Formulations for RNA Delivery.

A novel deep-learning technique for forecasting oil price volatility using historical prices of five precious metals in context of green financing – A comparison of deep learning, machine learning, and statistical models

Deep learning models for price forecasting of financial time series: A review of recent advancements: 2020–2022

A methodological study of exposome based on an open database: Association analysis between exposure to metal mixtures and hyperuricemia

Artificial neural networks for predicting mechanical properties of Al2219-B4C-Gr composites with multireinforcements

Deep Learning-Enabled Statistical Model Estimation for Power Transformers with Censoring and Truncation Problems

Machine learning applications and challenges in graft-versus-host disease: a scoping review.

Prediction Model for Postoperative Quality of Life Among Breast Cancer Survivors Along the Survivorship Trajectory From Pretreatment to 5 Years: Machine Learning-Based Analysis.

Leakage and the reproducibility crisis in machine-learning-based science

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Traditional Statistical Models Research Articles

Articles published on Traditional Statistical Models

Sample size and predictive performance of machine learning methods with survival data: A simulation study.

Forecasting the Acute Heart Failure Admissions: Development of Deep Learning Prediction Model Incorporating the Climate Information

Predicting success of atrial fibrillation ablation: comparing machine learning approaches of intracardiac electrograms

Intra-day solar irradiation forecast using machine learning with satellite data

Retrieval of suspended sediment concentrations using remote sensing and machine learning methods: A case study of the lower Yellow River

Application of Machine Learning Based on Structured Medical Data in Gastroenterology.

Prediction for the Sluice Deformation Based on SOA-LSTM-Weighted Markov Model

Monthly Runoff Prediction by Combined Models Based on Secondary Decomposition at the Wulong Hydrological Station in the Yangtze River Basin

Deep learning methods used in movie recommendation systems

Accident severity prediction modeling for road safety using random forest algorithm: an analysis of Indian highways.

How does the built environment affect hotel prices? A study using multiscale GWR and deep learning

Quality by Design (QbD) and Design of Experiments (DOE) as a Strategy for Tuning Lipid Nanoparticle Formulations for RNA Delivery.

A novel deep-learning technique for forecasting oil price volatility using historical prices of five precious metals in context of green financing – A comparison of deep learning, machine learning, and statistical models

Deep learning models for price forecasting of financial time series: A review of recent advancements: 2020–2022

A methodological study of exposome based on an open database: Association analysis between exposure to metal mixtures and hyperuricemia

Artificial neural networks for predicting mechanical properties of Al2219-B4C-Gr composites with multireinforcements

Deep Learning-Enabled Statistical Model Estimation for Power Transformers with Censoring and Truncation Problems

Machine learning applications and challenges in graft-versus-host disease: a scoping review.

Prediction Model for Postoperative Quality of Life Among Breast Cancer Survivors Along the Survivorship Trajectory From Pretreatment to 5 Years: Machine Learning-Based Analysis.

Leakage and the reproducibility crisis in machine-learning-based science