Internal Validation Dataset Research Articles

BackgroundIn recent years, there has been a surge in machine learning-based models for diagnosis and prognostication of outcomes in oncology. However, there are concerns relating to the model’s reproducibility and generalizability to a separate patient cohort (i.e., external validation). ObjectivesThis study primarily provides a validation study for a recently introduced and publicly available machine learning (ML) web-based prognostic tool (ProgTOOL) for overall survival risk stratification of oropharyngeal squamous cell carcinoma (OPSCC). Additionally, we reviewed the published studies that have utilized ML for outcome prognostication in OPSCC to examine how many of these models were externally validated, type of external validation, characteristics of the external dataset, and diagnostic performance characteristics on the internal validation (IV) and external validation (EV) datasets were extracted and compared. Methods: We used a total of 163 OPSCC patients obtained from the Helsinki University Hospital to externally validate the ProgTOOL for generalizability. In addition, PubMed, OvidMedline, Scopus, and Web of Science databases were systematically searched according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. ResultsThe ProgTOOL produced a predictive performance of 86.5% balanced accuracy, Mathew's correlation coefficient of 0.78, Net Benefit (0.7) and Brier score (0.06) for overall survival stratification of OPSCC patients as either low-chance or high-chance. In addition, out of a total of 31 studies found to have used ML for the prognostication of outcomes in OPSCC, only seven (22.6%) reported a form of EV. Three studies (42.9%) each used either temporal EV or geographical EV while only one study (14.2%) used expert as a form of EV. Most of the studies reported a reduction in performance when externally validated. ConclusionThe performance of the model in this validation study indicates that it may be generalized, therefore, bringing recommendations of the model for clinical evaluation closer to reality. However, the number of externally validated ML-based models for OPSCC is still relatively small. This significantly limits the transfer of these models for clinical evaluation and subsequently reduces the likelihood of the use of these models in daily clinical practice. As a gold standard, we recommend the use of geographical EV and validation studies to reveal biases and overfitting of these models. These recommendations are poised to facilitate the implementation of these models in clinical practice.

Identification of convulsive epilepsy in sub-Saharan Africa relies on access to resources that are often unavailable. Infrastructure and resource requirements can further complicate case verification. Using machine-learning techniques, we have developed and tested a region-specific questionnaire panel and predictive model to identify people who have had a convulsive seizure. These findings have been implemented into a free app for health-care workers in Kenya, Uganda, Ghana, Tanzania, and South Africa. In this retrospective case-control study, we used data from the Studies of the Epidemiology of Epilepsy in Demographic Sites in Kenya, Uganda, Ghana, Tanzania, and South Africa. We randomly split these individuals using a 7:3 ratio into a training dataset and a validation dataset. We used information gain and correlation-based feature selection to identify eight binary features to predict convulsive seizures. We then assessed several machine-learning algorithms to create a multivariate prediction model. We validated the best-performing model with the internal dataset and a prospectively collected external-validation dataset. We additionally evaluated a leave-one-site-out model (LOSO), in which the model was trained on data from all sites except one that, in turn, formed the validation dataset. We used these features to develop a questionnaire-based predictive panel that we implemented into a multilingual app (the Epilepsy Diagnostic Companion) for health-care workers in each geographical region. We analysed epilepsy-specific data from 4097 people, of whom 1985 (48·5%) had convulsive epilepsy, and 2112 were controls. From 170 clinical variables, we initially identified 20 candidate predictor features. Eight features were removed, six because of negligible information gain and two following review by a panel of qualified neurologists. Correlation-based feature selection identified eight variables that demonstrated predictive value; all were associated with an increased risk of an epileptic convulsion except one. The logistic regression, support vector, and naive Bayes models performed similarly, outperforming the decision-tree model. We chose the logistic regression model for its interpretability and implementability. The area under the receiver operator curve (AUC) was 0·92 (95% CI 0·91-0·94, sensitivity 85·0%, specificity 93·7%) in the internal-validation dataset and 0·95 (0·92-0·98, sensitivity 97·5%, specificity 82·4%) in the external-validation dataset. Similar results were observed for the LOSO model (AUC 0·94, 0·93-0·96, sensitivity 88·2%, specificity 95·3%). On the basis of these findings, we developed the Epilepsy Diagnostic Companion as a predictive model and app offering a validated culture-specific and region-specific solution to confirm the diagnosis of a convulsive epileptic seizure in people with suspected epilepsy. The questionnaire panel is simple and accessible for health-care workers without specialist knowledge to administer. This tool can be iteratively updated and could lead to earlier, more accurate diagnosis of seizures and improve care for people with epilepsy. The Wellcome Trust, the UK National Institute of Health Research, and the Oxford NIHR Biomedical Research Centre.

Internal Validation Dataset Research Articles

Related Topics

Articles published on Internal Validation Dataset

Development and Validation of a Nomogram for Preoperative Prediction of Early Recurrence after Upfront Surgery in Pancreatic Ductal Adenocarcinoma by Integrating Deep Learning and Radiological Variables.

Comprehensive bioinformatics analysis reveals the crosstalk genes and immune relationship between the systemic lupus erythematosus and venous thromboembolism.

Detecting Paroxysmal Atrial Fibrillation From an Electrocardiogram in Sinus Rhythm: External Validation of the AI Approach

Predicting emergency department visits among children with asthma in two academic medical systems

Development and validation of explainable machine-learning models for carotid atherosclerosis early screening

Differences of survival benefits brought by various treatments in ovarian cancer patients with different tumor stages

Determination and characterization of molecular heterogeneity and precision medicine strategies of patients with pancreatic cancer and pancreatic neuroendocrine tumor based on oxidative stress and mitochondrial dysfunction-related genes.

CT-based radiomics can identify physiological modifications of bone structure related to subjects' age and sex.

Machine learning-based prediction model for postoperative delirium in non-cardiac surgery

Development of a Bispectral index score prediction model based on an interpretable deep learning algorithm

Classification of Hypoglycemic Events in Type 1 Diabetes Using Machine Learning Algorithms.

Application of artificial intelligence for overall survival risk stratification in oropharyngeal carcinoma: A validation of ProgTOOL

Development and validation of a diagnostic aid for convulsive epilepsy in sub-Saharan Africa: a retrospective case-control study.

Tectonic infarct analysis: A computational tool for automated whole-brain infarct analysis from TTC-stained tissue

An immune-related gene signature predicts the 28-day mortality in patients with sepsis.

Deep learning for real-time detection of breast cancer presenting pathological nipple discharge by ductoscopy.

A Deep Learning Radiomics Nomogram to Predict Response to Neoadjuvant Chemotherapy for Locally Advanced Cervical Cancer: A Two-Center Study

Artificial intelligence-aided method to detect uterine fibroids in ultrasound images: a retrospective study

Predictive factors of microvascular invasion in patients with intrahepatic mass-forming cholangiocarcinoma based on magnetic resonance images.

Nomograms for Predicting Survival Outcomes in Patients with Neuroendocrine Neoplasms of the Gallbladder Undergoing Primary Tumor Resection: A Population-Based Study.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Internal Validation Dataset Research Articles

Related Topics

Articles published on Internal Validation Dataset

Development and Validation of a Nomogram for Preoperative Prediction of Early Recurrence after Upfront Surgery in Pancreatic Ductal Adenocarcinoma by Integrating Deep Learning and Radiological Variables.

Comprehensive bioinformatics analysis reveals the crosstalk genes and immune relationship between the systemic lupus erythematosus and venous thromboembolism.

Detecting Paroxysmal Atrial Fibrillation From an Electrocardiogram in Sinus Rhythm: External Validation of the AI Approach

Predicting emergency department visits among children with asthma in two academic medical systems

Development and validation of explainable machine-learning models for carotid atherosclerosis early screening

Differences of survival benefits brought by various treatments in ovarian cancer patients with different tumor stages

Determination and characterization of molecular heterogeneity and precision medicine strategies of patients with pancreatic cancer and pancreatic neuroendocrine tumor based on oxidative stress and mitochondrial dysfunction-related genes.

CT-based radiomics can identify physiological modifications of bone structure related to subjects' age and sex.

Machine learning-based prediction model for postoperative delirium in non-cardiac surgery

Development of a Bispectral index score prediction model based on an interpretable deep learning algorithm

Classification of Hypoglycemic Events in Type 1 Diabetes Using Machine Learning Algorithms.

Application of artificial intelligence for overall survival risk stratification in oropharyngeal carcinoma: A validation of ProgTOOL

Development and validation of a diagnostic aid for convulsive epilepsy in sub-Saharan Africa: a retrospective case-control study.

Tectonic infarct analysis: A computational tool for automated whole-brain infarct analysis from TTC-stained tissue

An immune-related gene signature predicts the 28-day mortality in patients with sepsis.

Deep learning for real-time detection of breast cancer presenting pathological nipple discharge by ductoscopy.

A Deep Learning Radiomics Nomogram to Predict Response to Neoadjuvant Chemotherapy for Locally Advanced Cervical Cancer: A Two-Center Study

Artificial intelligence-aided method to detect uterine fibroids in ultrasound images: a retrospective study

Predictive factors of microvascular invasion in patients with intrahepatic mass-forming cholangiocarcinoma based on magnetic resonance images.

Nomograms for Predicting Survival Outcomes in Patients with Neuroendocrine Neoplasms of the Gallbladder Undergoing Primary Tumor Resection: A Population-Based Study.