Application of machine learning in migraine classification: a call for study design standardization and global collaboration

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

Migraine is a complex neurological disorder with diverse clinical phenotypes and a multifaceted pathophysiology, which poses substantial challenges for accurate diagnosis, subtype differentiation, and biomarker discovery. Machine learning (ML) techniques have emerged as promising tools for classifying migraine patients and uncovering the underlying neurobiological mechanisms that differentiate migraine types and subtypes. This systematic review identifies current ML classification models for migraine types and subtypes, evaluating the quality, reproducibility, and clinical utility of published studies. The findings demonstrate that current ML models, particularly support vector machines and linear discriminant analysis, can accurately classify migraine patients based on structural and functional neuroimaging features with accuracies ranging from 75 to 98%. However, quality assessment revealed significant methodological heterogeneity across studies, including inconsistent reporting of model performance, insufficient patient phenotyping, small and imbalanced datasets, and limited external validation. These limitations hinder the global generalizability and reproducibility of these studies. We propose a roadmap for future research emphasizing well-characterized clinical subgrouping, standardized data acquisition and feature engineering protocols, transparency in model development and reporting, and collaborative multicentric designs to enable large-scale validation. Furthermore, this review stresses the importance of incorporating real-world phenotypic data, such as treatment response, comorbidities, and digital phenotyping metrics, to enrich ML models and support the transition toward precision medicine in migraine care. Ultimately, this review highlights the urgent need for methodological rigor in migraine ML classification studies to bridge the gap between experimental success and clinical applicability.Graphical Supplementary InformationThe online version contains supplementary material available at 10.1186/s10194-025-02134-9.

Similar Papers
  • Research Article
  • Cite Count Icon 255
  • 10.1103/physrevlett.126.190505
Information-Theoretic Bounds on Quantum Advantage in Machine Learning.
  • May 14, 2021
  • Physical Review Letters
  • Hsin-Yuan Huang + 2 more

We study the performance of classical and quantum machine learning (ML) models in predicting outcomes of physical experiments. The experiments depend on an input parameter x and involve execution of a (possibly unknown) quantum process E. Our figure of merit is the number of runs of E required to achieve a desired prediction performance. We consider classical ML models that perform a measurement and record the classical outcome after each run of E, and quantum ML models that can access E coherently to acquire quantum data; the classical or quantum data are then used to predict the outcomes of future experiments. We prove that for any input distribution D(x), a classical ML model can provide accurate predictions on average by accessing E a number of times comparable to the optimal quantum ML model. In contrast, for achieving an accurate prediction on all inputs, we prove that the exponential quantum advantage is possible. For example, to predict the expectations of all Pauli observables in an n-qubit system ρ, classical ML models require 2^{Ω(n)} copies of ρ, but we present a quantum ML model using only O(n) copies. Our results clarify where the quantum advantage is possible and highlight the potential for classical ML models to address challenging quantum problems in physics and chemistry.

  • Research Article
  • 10.1186/s12873-025-01468-6
AI-powered surveillance of bronchiolitis in the Nirsevimab era: comparative performance of machine learning, deep learning, and large language models on free-text ED records.
  • Mar 10, 2026
  • BMC emergency medicine
  • Marianna Costa + 5 more

Bronchiolitis remains a frequent reason for hospitalization in infants during the winter season. Epidemiologic surveillance remains crucial in the era of widespread immunoprophylaxis for the leading viral agent causing bronchiolitis. We investigated the performance of classical machine learning (ML) models, Deep Learning (DL), and a pre-trained large language model (LLM) in classifying bronchiolitis diagnosis from the free-text-diagnosis field of the emergency department electronic health records (EHRs). As a secondary aim, we evaluated the diagnostic accuracy of the actual official administrative ICD-9 encoding for Bronchiolitis diagnosis. 28,557 records of infants < 1 year with complete discharge diagnoses fields were retrieved between the years 2007–2018 and manually classified by an expert pediatrician to create the gold standard diagnosis set for training the algorithm. After data pre-processing, classical ML models (Random Forest, Decision Tree, Gradient Boosting Machine, Linear Discriminant Analysis, Support Vector Machine), a Deep Learning (DL) tool, and a pre-trained LLM (GPT-5) were evaluated using balanced accuracy, sensitivity, and F1 scores. The official administrative ICD-9 encoding classification accuracy was compared to the gold standard. Overall, 1,903 of 28,557 records (6.7%) were classified as bronchiolitis by the gold standard approach. The DL model and GPT-5 outperformed traditional ML models, achieving higher sensitivities (0.97, 95%CI 0.96-1.00, and 0.98, 95% CI 0.98–0.99, respectively), F1 scores (0.96, 95% CI 0.95–0.99, and 0.99, 95% CI 0.98–0.99, respectively), and balanced accuracy (0.98, 95%CI 0.98-1.00, and 0.99, 95% CI 0.99–0.99, respectively). Traditional ML models showed sensitivities between 0.77 and 0.98, F1 scores between 0.86 and 0.96, and balanced accuracies between 0.88 and 0.96. ICD-9 codes showed sensitivity of 85.9% (95% CI 84.27–87.45), and specificity of 98.5% (95% CI 98.36–98.65). To our knowledge, this is the first study directly comparing an LLM, deep learning, and multiple classical ML models for bronchiolitis surveillance in the post-Nirsevimab era. DL and GPT-5 outperformed traditional ML-based tools in identifying bronchiolitis diagnoses and ICD-9 diagnosis coding. AI-based tools hold significant potential for improving epidemiologic surveillance of bronchiolitis from emergency department EHRs. Not applicable.

  • Research Article
  • Cite Count Icon 27
  • 10.1002/cpe.7190
Hyper‐parametric improved machine learning models for solar radiation forecasting
  • Jul 26, 2022
  • Concurrency and Computation: Practice and Experience
  • Mantosh Kumar + 2 more

SummarySpatiotemporal solar radiation forecasting is extremely challenging due to its dependence on metrological and environmental factors. Chaotic time‐varying and non‐linearity make the forecasting model more complex. To cater this crucial issue, the paper provides a comprehensive investigation of the deep learning framework for the prediction of the two components of solar irradiation, that is, Diffuse Horizontal Irradiance (DHI) and Direct Normal Irradiance (DNI). Through exploratory data analysis the three recent most prominent deep learning (DL) architecture have been developed and compared with the other classical machine learning (ML) models in terms of the statistical performance accuracy. In our study, DL architecture includes convolutional neural network (CNN) and recurrent neural network (RNN) whereas classical ML models include Random Forest (RF), Support Vector Regression (SVR), Multilayer Perceptron (MLP), Extreme Gradient Boosting (XGB), and K‐Nearest Neighbor (KNN). Additionally, three optimization techniques Grid Search (GS), Random Search (RS), and Bayesian Optimization (BO) have been incorporated for tuning the hyper parameters of the classical ML models to obtain the best results. Based on the rigorous comparative analysis it was found that the CNN model has outperformed all classical machine learning and DL models having lowest mean squared error and highest R‐Squared value with least computational time.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 2
  • 10.3390/s23156833
Failure Severity Prediction for Protective-Coating Disbondment via the Classification of Acoustic Emission Signals.
  • Jul 31, 2023
  • Sensors
  • Noor A’In A Rahman + 3 more

Structural health monitoring is a popular inspection method that utilizes acoustic emission (AE) signals for fault detection in engineering infrastructures. Diagnosis based on the propagation of AE signals along any surface material offers an attractive solution for fault identification. However, the classification of AE signals originating from failure events, especially coating failure (coating disbondment), is a challenging task given the AE signature of each material. Thus, different experimental settings and analyses of AE signals are required to classify the various types of coating failures, and they are time-consuming and expensive. Hence, to address these issues, we utilized machine learning (ML) classification models in this work to evaluate epoxy-based-protective-coating disbondment based on the AE principle. A coating disbondment experiment consisting of coated carbon steel test panels for the collection of AE signals was implemented. The obtained AE signals were then processed to construct the final dataset to train various state-of-the-art ML classification models to divide the failure severity of coating disbondment into three classes. Consequently, methods for the extraction of useful features, the handling of data imbalance, and a reduction in the bias of ML models were also effectively utilized in this study. Evaluations of state-of-the-art ML classification models on the AE signal dataset in terms of standard metrics revealed that the decision forest classification model outperformed the other state-of-the-art models, with accuracy, precision, recall, and F1 score values of 99.48%, 98.76%, 97.58%, and 98.17%, respectively. These results demonstrate the effectiveness of utilizing ML classification models for the failure severity prediction of protective-coating defects via AE signals.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 21
  • 10.3390/rs16091582
Integrating Artificial Intelligence and UAV-Acquired Multispectral Imagery for the Mapping of Invasive Plant Species in Complex Natural Environments
  • Apr 29, 2024
  • Remote Sensing
  • Narmilan Amarasingam + 4 more

The proliferation of invasive plant species poses a significant ecological threat, necessitating effective mapping strategies for control and conservation efforts. Existing studies employing unmanned aerial vehicles (UAVs) and multispectral (MS) sensors in complex natural environments have predominantly relied on classical machine learning (ML) models for mapping plant species in natural environments. However, a critical gap exists in the literature regarding the use of deep learning (DL) techniques that integrate MS data and vegetation indices (VIs) with different feature extraction techniques to map invasive species in complex natural environments. This research addresses this gap by focusing on mapping the distribution of the Broad-leaved pepper (BLP) along the coastal strip in the Sunshine Coast region of Southern Queensland in Australia. The methodology employs a dual approach, utilising classical ML models including Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and Support Vector Machine (SVM) in conjunction with the U-Net DL model. This comparative analysis allows for an in-depth evaluation of the performance and effectiveness of both classical ML and advanced DL techniques in mapping the distribution of BLP along the coastal strip. Results indicate that the DL U-Net model outperforms classical ML models, achieving a precision of 83%, recall of 81%, and F1–score of 82% for BLP classification during training and validation. The DL U-Net model attains a precision of 86%, recall of 76%, and F1–score of 81% for BLP classification, along with an Intersection over Union (IoU) of 68% on the separate test dataset not used for training. These findings contribute valuable insights to environmental conservation efforts, emphasising the significance of integrating MS data with DL techniques for the accurate mapping of invasive plant species.

  • Research Article
  • Cite Count Icon 15
  • 10.1016/j.aap.2017.10.020
Improving autocoding performance of rare categories in injury classification: Is more training data or filtering the solution?
  • Nov 8, 2017
  • Accident Analysis &amp; Prevention
  • Gaurav Nanda + 2 more

Improving autocoding performance of rare categories in injury classification: Is more training data or filtering the solution?

  • Research Article
  • Cite Count Icon 5
  • 10.1016/j.xops.2023.100385
Machine Learning to Predict Faricimab Treatment Outcome in Neovascular Age-Related Macular Degeneration
  • Aug 18, 2023
  • Ophthalmology Science
  • Yusuke Kikuchi + 6 more

PurposeTo develop machine learning (ML) models to predict, at baseline, treatment outcomes at month 9 in patients with neovascular age-related macular degeneration (nAMD) receiving faricimab.DesignRetrospective proof of concept study.ParticipantsPatients enrolled in the phase II AVENUE trial (NCT02484690) of faricimab in nAMD.MethodsBaseline characteristics and spectral domain-OCT (SD-OCT) image data from 185 faricimab-treated eyes were split into 80% training and 20% test sets at the patient level. Input variables were baseline age, sex, best-corrected visual acuity (BCVA), central subfield thickness (CST), low luminance deficit, treatment arm, and SD-OCT images. A regression problem (BCVA) and a binary classification problem (reduction of CST by 35%) were considered. Overall, 10 models were developed and tested for each problem. Benchmark classical ML models (linear, random forest, extreme gradient boosting) were trained on baseline characteristics; benchmark deep neural networks (DNNs) were trained on baseline SD-OCT B-scans. Baseline characteristics and SD-OCT data were merged using 2 approaches: model stacking (using DNN prediction as an input feature for classical ML models) and model averaging (which averaged predictions from the DNN using SD-OCT volume and from classical ML models using baseline characteristics).Main Outcome MeasuresTreatment outcomes were defined by 2 target variables: functional (BCVA letter score) and anatomical (percent decrease in CST from baseline) outcomes at month 9.ResultsThe best-performing BCVA regression model with respect to the test coefficient of determination (R2) was the linear model in the model-stacking approach with R2 of 0.31. The best-performing CST classification model with respect to test area under receiver operating characteristics (AUROC) was the benchmark linear model with AUROC of 0.87. A post hoc analysis showed the baseline BCVA and the baseline CST had the most effect in the all-model prediction for BCVA regression and CST classification, respectively.ConclusionsPromising signals for predicting treatment outcomes from baseline characteristics were detected; however, the predictive benefit of baseline images was unclear in this proof-of-concept study. Further testing and validation with larger, independent datasets is required to fully explore the predictive capacity of ML models using baseline imaging data.Financial Disclosure(s)Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 26
  • 10.3390/w14071032
Stream Temperature Predictions for River Basin Management in the Pacific Northwest and Mid-Atlantic Regions Using Machine Learning
  • Mar 24, 2022
  • Water
  • Helen Weierbach + 6 more

Stream temperature (Ts) is an important water quality parameter that affects ecosystem health and human water use for beneficial purposes. Accurate Ts predictions at different spatial and temporal scales can inform water management decisions that account for the effects of changing climate and extreme events. In particular, widespread predictions of Ts in unmonitored stream reaches can enable decision makers to be responsive to changes caused by unforeseen disturbances. In this study, we demonstrate the use of classical machine learning (ML) models, support vector regression and gradient boosted trees (XGBoost), for monthly Ts predictions in 78 pristine and human-impacted catchments of the Mid-Atlantic and Pacific Northwest hydrologic regions spanning different geologies, climate, and land use. The ML models were trained using long-term monitoring data from 1980–2020 for three scenarios: (1) temporal predictions at a single site, (2) temporal predictions for multiple sites within a region, and (3) spatiotemporal predictions in unmonitored basins (PUB). In the first two scenarios, the ML models predicted Ts with median root mean squared errors (RMSE) of 0.69–0.84 °C and 0.92–1.02 °C across different model types for the temporal predictions at single and multiple sites respectively. For the PUB scenario, we used a bootstrap aggregation approach using models trained with different subsets of data, for which an ensemble XGBoost implementation outperformed all other modeling configurations (median RMSE 0.62 °C).The ML models improved median monthly Ts estimates compared to baseline statistical multi-linear regression models by 15–48% depending on the site and scenario. Air temperature was found to be the primary driver of monthly Ts for all sites, with secondary influence of month of the year (seasonality) and solar radiation, while discharge was a significant predictor at only 10 sites. The predictive performance of the ML models was robust to configuration changes in model setup and inputs, but was influenced by the distance to the nearest dam with RMSE &lt;1 °C at sites situated greater than 16 and 44 km from a dam for the temporal single site and regional scenarios, and over 1.4 km from a dam for the PUB scenario. Our results show that classical ML models with solely meteorological inputs can be used for spatial and temporal predictions of monthly Ts in pristine and managed basins with reasonable (&lt;1 °C) accuracy for most locations.

  • Dissertation
  • Cite Count Icon 1
  • 10.32657/10356/171848
Glaucoma detection based on optical coherence tomography imaging
  • Jan 1, 2023
  • Chi Li

The thesis evaluates the machine learning (ML) and deep learning (DL) approaches’ performance in accurately detecting glaucoma based on optical coherence tomography tabular data and images from individuals of different ethnicities. While numerous studies have employed ML and DL techniques for glaucoma identification, their performance has not been evaluated across diverse ethnic groups. In addition, a DL approach utilizing the Swin Transformer architecture trained on the thickness map images of the retinal nerve fiber layer (RNFL) was also evaluated. This Swin transformer DL model demonstrated an AUC of 0.97 in the internal testing dataset (Asian) and 0.88 in the external testing dataset (Caucasian). However, like the ML classifiers trained on measured data, the DL approach which was trained on raw thickness map images also exhibited poor reproducibility across different datasets. To address these issues, a cross-sectional study design was employed to investigate both ML and DL’s model performance in glaucoma detection using OCT data from individuals of different ethnicities. The study included 514 Asian participants, consisting of 257 with glaucoma and 257 controls, to develop ML and DL classifiers. The trained classifiers were subsequently evaluated on two separate participant groups comprising 356 Asians and 138 Caucasians. Two machine learning classifiers were created using the two types of RNFL thickness, one using the original values extracted from OCT machines (measured RNFL), and the other generated from the compensation model. The compensation model is a multivariate regression trained on normal individuals. It corrects the 12-clock RNFL thicknesses for multiple demographic and anatomical parameters. Additionally, a deep learning model was developed using the Swin Transformer architecture based on the measured RNFL thickness map images from OCT. Explainable artificial intelligence techniques (CAM and SHAP) were utilized to better interpret the results. Performance metrics such as the area under the receiver operating characteristic curve (AUC), accuracy and sensitivity were employed to examine the effectiveness of different glaucoma detection models. Both machine learning (AUC = 0.96) and deep learning (AUC = 0.97) models demonstrated superior performance compared to the raw measured data (baseline, AUC = 0.93), in the internal testing dataset (Asian). However, in the external testing dataset (Caucasian), ML models utilizing the compensated data (AUC = 0.93) exhibited significantly better performance compared to ML models using the original measured data (AUC = 0.83) and the baseline (AUC = 0.82). Furthermore, the ML and DL models trained on measured data exhibited inadequate generalization ability across different ethnicities, whereas the ML model using the compensated data maintained its performance in the external testing dataset. These findings caution against the indiscriminate application of ML and DL models to patient cohorts of different ethnicities. They also suggest that incorporating the compensation model into the development of ML models may enhance their performance in glaucoma detection across diverse ethnicities. Overall, the study highlights the importance of accounting for anatomical variations across different ethnic groups when developing machine-learning models for glaucoma detection using OCT data.

  • Research Article
  • 10.1097/pec.0000000000003572
Use of Machine Learning to Predict Hospital Admission for EMS-Treated Infants After a Suspected BRUE.
  • Feb 9, 2026
  • Pediatric emergency care
  • Jake Toy + 5 more

This study explored the use of different applied machine learning (ML) classification algorithms to predict hospital admission for infants treated by emergency medical services (EMS) after a suspected brief resolved unexplained event (BRUE). Data from a large regionalized pediatric care system were obtained for infants in which paramedic suspected a BRUE and who were transported between July 2017 and February 2021. After data pre-processing, a random 80%/20% split for training and testing was performed. First, a random forest ML classification model was used to identify and select the most important variables influencing the prediction of hospital admission. Then, multiple ML-based models and a statistical model were trained with this subset of variables and evaluated the performance of each to predict hospital admission. Model performance characteristics including the area under the receiver operator curve (AUROC) were reported. A total of 508 infants were included; 300 (59%) were admitted and 76 (15%) required critical care. The most important variables in predicting hospital admission were age, history of bystander interventions (ie, cardiopulmonary resuscitation and back blows), presence of past medical history, and a normal appearing examination. In the prediction of hospital admission, the support vector machine model achieved the highest AUROC of 0.85, with a sensitivity of 0.88 (95% CI: 0.80-0.96) and specificity of 0.71 (95% CI: 0.57-0.85). The predictive performance of the extreme gradient boosting, RF, and logistic regression models were similar (AUROC: 0.83 to 0.84). The applied ML models demonstrated good predictive performance for hospital admission for EMS-treated infants with a paramedic suspected BRUE. ML and statistical models had similar predictive performance.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 14
  • 10.1109/tse.2024.3350019
Test Input Prioritization for Machine Learning Classifiers
  • Mar 1, 2024
  • IEEE Transactions on Software Engineering
  • Xueqi Dang + 5 more

Machine learning has achieved remarkable success across diverse domains. Nevertheless, concerns about interpretability in black-box models, especially within Deep Neural Networks (DNNs), have become pronounced in safety-critical fields like healthcare and finance. Classical machine learning (ML) classifiers, known for their higher interpretability, are preferred in these domains. Similar to DNNs, classical ML classifiers can exhibit bugs that could lead to severe consequences in practice. Test input prioritization has emerged as a promising approach to ensure the quality of an ML system, which prioritizes potentially misclassified tests so that such tests can be identified earlier with limited manual labeling costs. However, when applying to classical ML classifiers, existing DNN test prioritization methods are constrained from three perspectives: 1) Coverage-based methods are inefficient and time-consuming; 2) Mutation-based methods cannot be adapted to classical ML models due to mismatched model mutation rules; 3) Confidence-based methods are restricted to a single dimension when applying to binary ML classifiers, solely depending on the model’s prediction probability for one class. To overcome the challenges, we propose MLPrior, a test prioritization approach specifically tailored for classical ML models. MLPrior leverages the characteristics of classical ML classifiers (i.e., interpretable models and carefully engineered attribute features) to prioritize test inputs. The foundational principles are: 1) tests more sensitive to mutations are more likely to be misclassified, and 2) tests closer to the model’s decision boundary are more likely to be misclassified. Building on the first concept, we design mutation rules to generate two types of mutation features (i.e., <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">model mutation features</b> and <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">input mutation features</b> ) for each test. Drawing from the second notion, MLPrior generates <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">attribute features</b> of each test based on its attribute values, which can indirectly reveal the proximity between the test and the decision boundary. For each test, MLPrior combines all three types of features of it into a final vector. Subsequently, MLPrior employs a pre-trained ranking model to predict the misclassification probability of each test based on its final vector and ranks tests accordingly. We conducted an extensive study to evaluate MLPrior based on 185 subjects, encompassing natural datasets, mixed noisy datasets, and fairness datasets. The results demonstrate that MLPrior outperforms all the compared test prioritization approaches, with an average improvement of 14.74%∼66.93% on natural datasets, 18.55%∼67.73% on mixed noisy datasets, and 15.34%∼62.72% on fairness datasets.

  • Research Article
  • Cite Count Icon 8
  • 10.1186/s12905-025-03669-4
Development and evaluation of a machine learning model for osteoporosis risk prediction in Korean women
  • Mar 28, 2025
  • BMC Women's Health
  • Minkyung Je + 3 more

BackgroundThe aim of this study was to develop a machine learning (ML) model for classifying osteoporosis in Korean women based on a large-scale population cohort study. This study also aimed to assess ML model performance compared with traditional osteoporosis screening tools. Furthermore, this study aimed to examine the factors influencing the risk of osteoporosis through variable importance.MethodsData was collected from 4199 women aged 40–69 years in the baseline survey of the Ansan and Ansung cohort of the Korean Genome and Epidemiology Study. Osteoporosis was set as the dependent variable to develop ML classification models. Independent variables included 122 factors related to osteoporosis risk, such as socio-demographic characteristics, anthropometric parameters, lifestyle factors, reproductive factors, nutrient intakes, diet quality indices, medical history, medication history, family history, biochemical parameters, and genetic factors. The six classification models were developed using ML techniques, including decision tree, random forest, multilayer perceptron, support vector machine, light gradient boosting machine, and extreme gradient boosting (XGBoost). The six ML classification models were compared with two traditional osteoporosis screening tools, including the osteoporosis risk assessment instrument (ORAI) and the osteoporosis self-assessment tool (OST). The ML model performances were evaluated and compared using the confusion matrix and area under the curve (AUC) metrics. Variable importance was assessed using the XGBoost technique to investigate osteoporosis risk factors.ResultsThe XGBoost model showed the highest performance out of the six ML classification models, with an accuracy of 0.705, precision of 0.664, recall of 0.830, and F1 score of 0.738. Moreover, the XGBoost model showed a higher performance on AUC than ORAI and OST. Variable importance scores were identified for 69 out of the 122 variables associated with osteoporosis risk factors. Age at menopause ranked first in variable importance. Variables of arthritis, physical activities, hypertension, education level, income level; alcohol intake, potassium intake, homeostatic model assessment for insulin resistance; energy intake, vitamin C intake, gout; and dietary inflammatory index ranked in the top 20 out of the 69 variables, using the XGBoost technique.ConclusionsThis study found that an XGBoost model can be utilized to classify osteoporosis in Korean women. Age at menopause is a significant factor in osteoporosis risk, followed by arthritis, physical activities, hypertension, and education level.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 6
  • 10.1007/s10278-023-00957-z
MRI-Based Machine Learning Fusion Models to Distinguish Encephalitis and Gliomas
  • Jan 12, 2024
  • Journal of imaging informatics in medicine
  • Fei Zheng + 7 more

This paper aims to compare the performance of the classical machine learning (CML) model and the deep learning (DL) model, and to assess the effectiveness of utilizing fusion radiomics from both CML and DL in distinguishing encephalitis from glioma in atypical cases. We analysed the axial FLAIR images of preoperative MRI in 116 patients pathologically confirmed as gliomas and clinically diagnosed with encephalitis. The 3 CML models (logistic regression (LR), support vector machine (SVM) and multi-layer perceptron (MLP)), 3 DL models (DenseNet 121, ResNet 50 and ResNet 18) and a deep learning radiomic (DLR) model were established, respectively. The area under the receiver operating curve (AUC) and sensitivity, specificity, accuracy, negative predictive value (NPV) and positive predictive value (PPV) were calculated for the training and validation sets. In addition, a deep learning radiomic nomogram (DLRN) and a web calculator were designed as a tool to aid clinical decision-making. The best DL model (ResNet50) consistently outperformed the best CML model (LR). The DLR model had the best predictive performance, with AUC, sensitivity, specificity, accuracy, NPV and PPV of 0.879, 0.929, 0.800, 0.875, 0.867 and 0.889 in the validation sets, respectively. Calibration curve of DLR model shows good agreement between prediction and observation, and the decision curve analysis (DCA) indicated that the DLR model had higher overall net benefit than the other two models (ResNet50 and LR). Meanwhile, the DLRN and web calculator can provide dynamic assessments. Machine learning (ML) models have the potential to non-invasively differentiate between encephalitis and glioma in atypical cases. Furthermore, combining DL and CML techniques could enhance the performance of the ML models.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 19
  • 10.1371/journal.pone.0282608
A hybrid CNN and ensemble model for COVID-19 lung infection detection on chest CT scans.
  • Mar 9, 2023
  • PLOS ONE
  • Ahmed A Akl + 3 more

COVID-19 is highly infectious and causes acute respiratory disease. Machine learning (ML) and deep learning (DL) models are vital in detecting disease from computerized chest tomography (CT) scans. The DL models outperformed the ML models. For COVID-19 detection from CT scan images, DL models are used as end-to-end models. Thus, the performance of the model is evaluated for the quality of the extracted feature and classification accuracy. There are four contributions included in this work. First, this research is motivated by studying the quality of the extracted feature from the DL by feeding these extracted to an ML model. In other words, we proposed comparing the end-to-end DL model performance against the approach of using DL for feature extraction and ML for the classification of COVID-19 CT scan images. Second, we proposed studying the effect of fusing extracted features from image descriptors, e.g., Scale-Invariant Feature Transform (SIFT), with extracted features from DL models. Third, we proposed a new Convolutional Neural Network (CNN) to be trained from scratch and then compared to the deep transfer learning on the same classification problem. Finally, we studied the performance gap between classic ML models against ensemble learning models. The proposed framework is evaluated using a CT dataset, where the obtained results are evaluated using five different metrics The obtained results revealed that using the proposed CNN model is better than using the well-known DL model for the purpose of feature extraction. Moreover, using a DL model for feature extraction and an ML model for the classification task achieved better results in comparison to using an end-to-end DL model for detecting COVID-19 CT scan images. Of note, the accuracy rate of the former method improved by using ensemble learning models instead of the classic ML models. The proposed method achieved the best accuracy rate of 99.39%.

  • Research Article
  • Cite Count Icon 46
  • 10.1016/j.compag.2023.107723
Leaf area index estimation of pergola-trained vineyards in arid regions using classical and deep learning methods based on UAV-based RGB images
  • Mar 1, 2023
  • Computers and Electronics in Agriculture
  • Osman Ilniyaz + 7 more

Leaf area index estimation of pergola-trained vineyards in arid regions using classical and deep learning methods based on UAV-based RGB images

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant