Types Of Machine Learning Models Research Articles

Abstract Study question Can advanced machine learning applied to the preoperative assessment predict the testicular sperm extraction outcome in azoospermic context and how many patients are required? Summary answer Despite encouraging results (AUC = 92.0%, sensitivity = 83.9% and specificity = 84.2%), integrating new biomarkers would probably be more relevant than enrolling additional patients. What is known already Testicular sperm extraction (TESE) is an essential therapeutic tool for the male infertility management and is often the “last hope” before gamete donation for these patients. However, it is an invasive procedure and is successful in up to 50%. Until now, no model is sufficiently powerful to accurately predict the success of sperm retrieval in TESE. Among the few models already developed, the findings are highly disparate despite having common input data (preoperative assessment). Moreover, only few types of machine learning models and procedures have been investigated. Performances were mostly capped despite the inclusion sometimes of more than 1000 patients. Study design, size, duration Data of 175 patients who underwent TESE between 2012 and 2021 were retrospectively analyzed. The performances of a wide range of preprocessing methods and machine learning models (state-of-the-art methods in machine learning) we explored, evaluated, and compared. The objective was to predict the presence or absence of spermatozoa, using 17 parameters (clinical, hormonal, genetic, history) from the preoperative assessment. The study protocol was approved by a local ethics committee (IRB CER-2021-041). Participants/materials, setting, methods After data preprocessing (standardization…), Machine Learning models (Bayesian Naive Classification, logistic regression, k-nearest neighbor classifier, support vector machine, random forests, GradientBoosting and XGBoost) and Deep Learning models were tested. The validation procedure consisted of splitting the dataset into a training set and test set. Beyond the standard metrics (sensitivity, specificity, AUC-ROC), the identification of the most relevant variables and the learning curve to determine the optimal patient number to be included were performed. Main results and the role of chance At least one live spermatozoon was found in the testicular tissue of 104 (59.4%) patients (positive TESE) out of 175. The best performing model (Random Forest with appropriate preprocessing) obtained the following results on the test set: AUC = 92.0%, sensitivity = 83.9% and specificity = 84.2%, leading to an efficient tool, which gives additional and more relevant information than the different variables taken separately. Inhibin B, FSH and history of cryptorchidism were the variables with the most discriminating power. However, a plateau in the model performance was observed (beyond 110 patients), whatever the approach or the preprocessing used. A trend curve shows that beyond 110 patients, no improvement can be observed and cast doubt about the power of the traditional preoperative parameters assessed before TESE. The classic preoperative assessment can probably not fully predict the TESE outcomes. Further work is needed to be enhance with new hypothesis and the use of new biomarkers to be integrated into the models. Limitations, reasons for caution The main limitation was the monocentric design and the use of retrospective data. Wider implications of the findings Machine learning models can provide the basis for an enhanced decision support system tool in the context of azoospermia. Indefinitely increasing the number of participants is not likely to be the solution: further hypotheses and biomarkers integration into the models will probably be necessary to improve performance. Trial registration number not applicable

Abstract mRNA transcriptomic markers have become a key tool in the classification of breast cancer and deciding treatment regimens for patients. However, limited studies have evaluated markers that might be predictive of response to neoadjuvant treatment (chemotherapy and endocrine treatment). Defining such key markers that are associated with treatment response could offer new insights on the relative molecular mechanisms and provide more tailored treatment regimens to target pathways that might be over-represented. We aimed to identify mRNA expression markers associated with breast cancer patients’ response to neoadjuvant treatment from a pool of studies in which tumor samples were collected at pre-treatment and on-treatment timepoints. We collated 1194 pre- and on-treatment samples (721 unique patients) from 9 publicly available gene expression datasets that met our inclusion criteria. The standardized gene expression values from each study were merged in a global matrix of 1551 genes and 1194 samples. Differential Gene Expression Analysis adjusted for timepoint, pam50 subtype (as predicted by the genefu package in R), treatment and batch effects was conducted. 14 significantly (FDR = 0.05) differentially expressed were identified (CCNA2, RFC3, VRK1, PSMB2, ALDH7A1, RRM2, ADRM1, TSPAN4, ZNF473, RNH1, HDAC1, CDK1, SMC1A and TOR1AIP1) when responders and non-responders were compared, some of which are known drug targets (e.g. PSMB2/carfilzomib). Using the set of identified genes we performed Monte-Carlo consensus clustering on the full set (optimal number of clusters: 6). Clusters were significantly associated with response (p = 2·10-8), timepoint (p &lt; 10-15) and pam50 subtype (p &lt; 10-15), but not treatment (p = 0.35). We then split our data into a training, a validation and a test set (776, 299, 119 samples respectively) and used the 14 genes (alone or in combination with metadata) as predictors to fit three types of machine learning models (lasso-regularized Logistic Regression, Decision Trees and Support Vector Machines). Support Vector Machines demonstrated the best classification performance on the validation set (75% classification accuracy) and achieved accuracy of 80% on the test set. Pathway analysis based on the identified genes revealed enriched nuclear membrane organization, protein deubiquitination, DNA replication and histone modification pathways. Prediction of response to treatment at baseline or mid-treatment can aid in patient stratification in the neoadjuvant setting and separate patients who would benefit from treatment the most and could undergo a less extensive surgical operation from patients who are unlikely to respond and should be scheduled for surgery sooner. This kind of early intervention has the potential to lead to improved patient outcomes and reduced side effects from unnecessary treatment administration. Citation Format: Aristeidis Sionakidis, Jonine D. Figueroa, Timothy I. Cannings. A novel 14-gene signature to predict response to neoadjuvant chemotherapy and endocrine treatment in breast cancer patients [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 1233.

Types Of Machine Learning Models Research Articles

Related Topics

Articles published on Types Of Machine Learning Models

Alternative stopping rules to limit tree expansion for random forest models

Impact Assessment of COVID-19 Lockdown on Vertical Distributions of NO2 and HCHO From MAX-DOAS Observations and Machine Learning Models.

Detection of Fake News Based on Typical Machine Learning Models

Artificial Intelligence as an Actor in Innovation Teams: An Assessment of the GPT-3 Language Model

An Automated System for Fruit Adulteration and Fruit Grading

P-057 Machine learning-based prediction of testicular sperm extraction: comparison of different preprocessing and models, required sample size and relevance of input biomarkers

Abstract 1233: A novel 14-gene signature to predict response to neoadjuvant chemotherapy and endocrine treatment in breast cancer patients

XAI in the Context of Predictive Process Monitoring: An Empirical Analysis Framework

A Robust Approach of Multi-sensor Fusion for Fault Diagnosis Using Convolution Neural Network

Compressive Strength Estimation of Geopolymer Composites through Novel Computational Approaches.

HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer.

Prediction of the trajectories of depressive symptoms among children in the adolescent brain cognitive development (ABCD) study using machine learning approach

OTA-TinyML: Over the Air Deployment of TinyML Models and Execution on IoT Devices

Stochastic integrated machine learning based multiscale approach for the prediction of the thermal conductivity in carbon nanotube reinforced polymeric composites

Chimney Identification Tool for Automated Detection of Hydrothermal Chimneys from High-Resolution Bathymetry Using Machine Learning

Hospital Length of Stay and 30-Day Mortality Prediction in Stroke: A Machine Learning Analysis of 17,000 ICU Admissions in Brazil.

Estimating Evapotranspiration of Screenhouse Banana Plantations Using Artificial Neural Network and Multiple Linear Regression Models

Using machine learning for particle track identification in the CLAS12 detector

Prediction of Myocardial Infarction From Patient Features With Machine Learning.

Groundwater level prediction using machine learning models: A comprehensive review

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Types Of Machine Learning Models Research Articles

Related Topics

Articles published on Types Of Machine Learning Models

Alternative stopping rules to limit tree expansion for random forest models

Impact Assessment of COVID-19 Lockdown on Vertical Distributions of NO2 and HCHO From MAX-DOAS Observations and Machine Learning Models.

Detection of Fake News Based on Typical Machine Learning Models

Artificial Intelligence as an Actor in Innovation Teams: An Assessment of the GPT-3 Language Model

An Automated System for Fruit Adulteration and Fruit Grading

P-057 Machine learning-based prediction of testicular sperm extraction: comparison of different preprocessing and models, required sample size and relevance of input biomarkers

Abstract 1233: A novel 14-gene signature to predict response to neoadjuvant chemotherapy and endocrine treatment in breast cancer patients

XAI in the Context of Predictive Process Monitoring: An Empirical Analysis Framework

A Robust Approach of Multi-sensor Fusion for Fault Diagnosis Using Convolution Neural Network

Compressive Strength Estimation of Geopolymer Composites through Novel Computational Approaches.

HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer.

Prediction of the trajectories of depressive symptoms among children in the adolescent brain cognitive development (ABCD) study using machine learning approach

OTA-TinyML: Over the Air Deployment of TinyML Models and Execution on IoT Devices

Stochastic integrated machine learning based multiscale approach for the prediction of the thermal conductivity in carbon nanotube reinforced polymeric composites

Chimney Identification Tool for Automated Detection of Hydrothermal Chimneys from High-Resolution Bathymetry Using Machine Learning

Hospital Length of Stay and 30-Day Mortality Prediction in Stroke: A Machine Learning Analysis of 17,000 ICU Admissions in Brazil.

Estimating Evapotranspiration of Screenhouse Banana Plantations Using Artificial Neural Network and Multiple Linear Regression Models

Using machine learning for particle track identification in the CLAS12 detector

Prediction of Myocardial Infarction From Patient Features With Machine Learning.

Groundwater level prediction using machine learning models: A comprehensive review