Applications of Machine Learning in Assessing Cognitive Load of Uncrewed Aerial System Operators and in Enhancing Training: A Systematic Review
This research is based on a systematic review of machine learning (ML) approaches for the cognitive load (CL) assessment of applications for unmanned aerial system (UAS) operator training. The review synthesises evidence on how ML techniques have been applied to assess CL using diverse data sources, including physiological signals (e.g., EEG, HRV), behavioural measures (e.g., eye-tracking), and performance indicators. It highlights the effectiveness of models such as Support Vector Machines (SVMs), Random Forests (RFs), and advanced deep learning (DL) architectures such as Long Short-Term Memory (LSTM), as well as how the use of different methods affects the performance of ML models, with studies reporting accuracies of up to 98%. The findings also indicate that, compared with traditional UAS training approaches, ML approaches can enhance training by providing adaptive assessment, with methodological factors such as model selection, data preprocessing, and validation being central to ML assessment performance. These findings highlight the value of accurate CL assessment as a foundation for adaptive training systems, supporting enhanced UAS operator performance and operational safety. By consolidating the methodological insights and identifying research gaps, this review provides valuable background information for advancing ML-based CL assessment and its integration into adaptive UAS operator training systems to enhance UAS operator training.
- Preprint Article
1
- 10.5194/egusphere-egu2020-690
- Jul 17, 2020
<p>The advancement of big data and increased computational power have contributed to an increased use of Machine Learning (ML) approaches in hydrological modelling. These approaches are powerful tools for modeling non-linear systems. However, the applicability of ML in non-stationary conditions needs to be studied further. As climate change will change hydrological patterns, testing ML approaches for non-stationary conditions is essential. Here, we used the Differential Split-Sample Test (DSST) to test the climate transposability of ML approaches (e.g., calibrating in a wet period and validating in a dry one, and vice-versa).  We applied five ML approaches using daily precipitation and temperature as input for the prediction of the daily discharge in six snow-dominated Swiss catchments. Lower and upper benchmarks were used to evaluate performances through a relative performance measure. The lower benchmark is the average of the bucket-type HBV model runs from 1000 random parameter sets. The upper benchmark is the automatically calibrated HBV model. In comparison with the stationary condition, the models performed slightly poorer in the non-stationary condition. The performance of simple ML approaches was poor for non-stationary conditions with an underestimation of peak flows, as well as a poor representation of the snow-melting period. On the other hand, a more complex ML approach (deep learning), the Long Short -Term Memory (LSTM), showed a good performance when compared with the lower and upper benchmarks. This might be explained by the fact that the so-called memory cell allowed to simulate the storage effects. </p>
- Research Article
36
- 10.1097/corr.0000000000001360
- Jul 30, 2020
- Clinical Orthopaedics & Related Research
Machine learning (ML) is a subdomain of artificial intelligence that enables computers to abstract patterns from data without explicit programming. A myriad of impactful ML applications already exists in orthopaedics ranging from predicting infections after surgery to diagnostic imaging. However, no systematic reviews that we know of have compared, in particular, the performance of ML models with that of clinicians in musculoskeletal imaging to provide an up-to-date summary regarding the extent of applying ML to imaging diagnoses. By doing so, this review delves into where current ML developments stand in aiding orthopaedists in assessing musculoskeletal images. This systematic review aimed (1) to compare performance of ML models versus clinicians in detecting, differentiating, or classifying orthopaedic abnormalities on imaging by (A) accuracy, sensitivity, and specificity, (B) input features (for example, plain radiographs, MRI scans, ultrasound), (C) clinician specialties, and (2) to compare the performance of clinician-aided versus unaided ML models. A systematic review was performed in PubMed, Embase, and the Cochrane Library for studies published up to October 1, 2019, using synonyms for machine learning and all potential orthopaedic specialties. We included all studies that compared ML models head-to-head against clinicians in the binary detection of abnormalities in musculoskeletal images. After screening 6531 studies, we ultimately included 12 studies. We conducted quality assessment using the Methodological Index for Non-randomized Studies (MINORS) checklist. All 12 studies were of comparable quality, and they all clearly included six of the eight critical appraisal items (study aim, input feature, ground truth, ML versus human comparison, performance metric, and ML model description). This justified summarizing the findings in a quantitative form by calculating the median absolute improvement of the ML models compared with clinicians for the following metrics of performance: accuracy, sensitivity, and specificity. ML models provided, in aggregate, only very slight improvements in diagnostic accuracy and sensitivity compared with clinicians working alone and were on par in specificity (3% (interquartile range [IQR] -2.0% to 7.5%), 0.06% (IQR -0.03 to 0.14), and 0.00 (IQR -0.048 to 0.048), respectively). Inputs used by the ML models were plain radiographs (n = 8), MRI scans (n = 3), and ultrasound examinations (n = 1). Overall, ML models outperformed clinicians more when interpreting plain radiographs than when interpreting MRIs (17 of 34 and 3 of 16 performance comparisons, respectively). Orthopaedists and radiologists performed similarly to ML models, while ML models mostly outperformed other clinicians (outperformance in 7 of 19, 7 of 23, and 6 of 10 performance comparisons, respectively). Two studies evaluated the performance of clinicians aided and unaided by ML models; both demonstrated considerable improvements in ML-aided clinician performance by reporting a 47% decrease of misinterpretation rate (95% confidence interval [CI] 37 to 54; p < 0.001) and a mean increase in specificity of 0.048 (95% CI 0.029 to 0.068; p < 0.001) in detecting abnormalities on musculoskeletal images. At present, ML models have comparable performance to clinicians in assessing musculoskeletal images. ML models may enhance the performance of clinicians as a technical supplement rather than as a replacement for clinical intelligence. Future ML-related studies should emphasize how ML models can complement clinicians, instead of determining the overall superiority of one versus the other. This can be accomplished by improving transparent reporting, diminishing bias, determining the feasibility of implantation in the clinical setting, and appropriately tempering conclusions. Level III, diagnostic study.
- Research Article
44
- 10.1002/aps3.11371
- Jun 1, 2020
- Applications in Plant Sciences
Plants meet machines: Prospects in machine learning for plant biology
- Research Article
42
- 10.1016/j.cscm.2023.e02723
- Nov 29, 2023
- Case Studies in Construction Materials
Ultra-high-performance concrete (UHPC) is a sustainable construction material; it can be applied as a substitute for cement concrete. Artificial intelligence methods have been used to evaluate concrete composites to reduce time and money in the construction industries. So, this study applied machine learning (ML) and hybrid ML approaches to predict the compressive and flexural strength of UHPC. A dataset of 626 compressive strength and 317 flexural strength data points was collected from the published research articles, where fourteen important variables were selected as input parameters for the analysis of hybrid ML and ML algorithms. This research used XGBoost, LightGBM, and hybrid XGBoost- LightGBM algorithms to predict UHPC materials. Grid search (GS) techniques were used to adjust model hyper-parameters in search of improved high accuracy and efficiency. ML and hybrid ML models were train, and the test stage utilized statistical assessments such as coefficient of determination (R-square), root mean square error (RMSE), mean absolute error (MAE), and coefficient of efficiency (CE). The results presented hybrid ML algorithm was superior to the XGBoost and LightGBM algorithms in terms of R-square and RMSE values for both compressive and flexural strength prediction. A hybrid ML model and two ML models showed CS considerable R-square values above 0.94 at the testing stages and just over 0.97 at the training phase. Hybrid ML model performance accuracy for CS prediction R-square value found that almost 0.996 for training and 0.963 for testing phases. At the same time, the FS prediction result showed that the R-square value of the Hybrid ML model and two traditional ML models were found at almost 0.95 for the training phase and around 0.81 for the testing phase. But among them, the hybrid XGB-LGB model prediction performance was high accuracy and lowest error for CS and FS of UHPC trained and its hyperparameters optimized. Additionally, the SHAP investigation reveals the impact and relationship of the input variables with the output variables. SHAP analysis outcome reveals that curing age and steel fiber content input parameter had the highest positive impact on compressive strength and flexural strength of UHPC.
- Research Article
57
- 10.1016/j.trechm.2020.10.007
- Nov 9, 2020
- Trends in Chemistry
Chemical heuristics have been fundamental to the advancement of chemistry and materials science. These heuristics are typically established by scientists using knowledge and creativity to extract patterns from limited datasets. Machine learning offers opportunities to perfect this approach using computers and larger datasets. Here, we discuss the relationships between traditional heuristics and machine learning approaches. We show how traditional rules can be challenged by large-scale statistical assessment and how traditional concepts commonly used as features are feeding the machine learning techniques. We stress the waste involved in relearning chemical rules and the challenges in terms of data size requirements for purely data-driven approaches. Our view is that heuristic and machine learning approaches are at their best when they work together.
- Research Article
- 10.1093/bjs/znaf042.035
- Mar 12, 2025
- British Journal of Surgery
Background Peripheral Arterial Disease(PAD) is a prevalent cardiovascular condition affecting millions worldwide, and the advent of machine learning(ML) techniques offers promising opportunities to enhance risk prediction and personalise patient management in this complex disease. This systematic review and meta-analysis aimed to evaluate the performance of ML models in predicting adverse outcomes in PAD patients compared to traditional statistical approaches. Methods A comprehensive search of major databases(MEDLINE,EMBASE,CINAHL,Cochrane CENTRAL) was conducted for studies published between 2000 and 2024. Two reviewers independently screened studies, extracted data, and assessed risk of bias using the PROBAST tool. The quality of included studies was also assessed using criteria outlined by Qiao et al. A modified Hierarchical Summary Receiver Operating Characteristic(HSROC) analysis was performed to compare the predictive performance of various ML models and traditional regression methods. Results Thirteen studies met the inclusion criteria. Gradient boosted models demonstrated the highest predictive performance with a diagnostic odds ratio(DOR) of 36.593(95% CI:24.44-54.79), sensitivity of 0.853, and specificity of 0.863. The best performing ML models for outcomes within 2 years, showed a DOR of 34.169(95% CI:22.840-51.117). Traditional regression models consistently underperformed compared to ML approaches, with the lowest DOR of 3.326(95% CI:1.814-6.097). The quality assessment revealed a mix of methodological rigor, with 54% of studies rated as low risk of bias and 46% as unclear. Conclusion Advanced ML techniques demonstrate superior predictive power for adverse outcomes in PAD patients compared to traditional regression methods.
- Research Article
2
- 10.1108/jd-05-2022-0096
- Apr 26, 2024
- Journal of Documentation
PurposeThis paper provides practical advice for archaeologists and heritage specialists wishing to use ML approaches to identify archaeological features in high-resolution satellite imagery (or other remotely sensed data sources). We seek to balance the disproportionately optimistic literature related to the application of ML to archaeological prospection through a discussion of limitations, challenges and other difficulties. We further seek to raise awareness among researchers of the time, effort, expertise and resources necessary to implement ML successfully, so that they can make an informed choice between ML and manual inspection approaches.Design/methodology/approachAutomated object detection has been the holy grail of archaeological remote sensing for the last two decades. Machine learning (ML) models have proven able to detect uniform features across a consistent background, but more variegated imagery remains a challenge. We set out to detect burial mounds in satellite imagery from a diverse landscape in Central Bulgaria using a pre-trained Convolutional Neural Network (CNN) plus additional but low-touch training to improve performance. Training was accomplished using MOUND/NOT MOUND cutouts, and the model assessed arbitrary tiles of the same size from the image. Results were assessed using field data.FindingsValidation of results against field data showed that self-reported success rates were misleadingly high, and that the model was misidentifying most features. Setting an identification threshold at 60% probability, and noting that we used an approach where the CNN assessed tiles of a fixed size, tile-based false negative rates were 95–96%, false positive rates were 87–95% of tagged tiles, while true positives were only 5–13%. Counterintuitively, the model provided with training data selected for highly visible mounds (rather than all mounds) performed worse. Development of the model, meanwhile, required approximately 135 person-hours of work.Research limitations/implicationsOur attempt to deploy a pre-trained CNN demonstrates the limitations of this approach when it is used to detect varied features of different sizes within a heterogeneous landscape that contains confounding natural and modern features, such as roads, forests and field boundaries. The model has detected incidental features rather than the mounds themselves, making external validation with field data an essential part of CNN workflows. Correcting the model would require refining the training data as well as adopting different approaches to model choice and execution, raising the computational requirements beyond the level of most cultural heritage practitioners.Practical implicationsImproving the pre-trained model’s performance would require considerable time and resources, on top of the time already invested. The degree of manual intervention required – particularly around the subsetting and annotation of training data – is so significant that it raises the question of whether it would be more efficient to identify all of the mounds manually, either through brute-force inspection by experts or by crowdsourcing the analysis to trained – or even untrained – volunteers. Researchers and heritage specialists seeking efficient methods for extracting features from remotely sensed data should weigh the costs and benefits of ML versus manual approaches carefully.Social implicationsOur literature review indicates that use of artificial intelligence (AI) and ML approaches to archaeological prospection have grown exponentially in the past decade, approaching adoption levels associated with “crossing the chasm” from innovators and early adopters to the majority of researchers. The literature itself, however, is overwhelmingly positive, reflecting some combination of publication bias and a rhetoric of unconditional success. This paper presents the failure of a good-faith attempt to utilise these approaches as a counterbalance and cautionary tale to potential adopters of the technology. Early-majority adopters may find ML difficult to implement effectively in real-life scenarios.Originality/valueUnlike many high-profile reports from well-funded projects, our paper represents a serious but modestly resourced attempt to apply an ML approach to archaeological remote sensing, using techniques like transfer learning that are promoted as solutions to time and cost problems associated with, e.g. annotating and manipulating training data. While the majority of articles uncritically promote ML, or only discuss how challenges were overcome, our paper investigates how – despite reasonable self-reported scores – the model failed to locate the target features when compared to field data. We also present time, expertise and resourcing requirements, a rarity in ML-for-archaeology publications.
- Research Article
5
- 10.1128/spectrum.01703-23
- Oct 31, 2023
- Microbiology Spectrum
Antimicrobial resistance (AMR) in Neisseria gonorrhoeae is an urgent global health issue. Machine learning (ML) is a powerful tool that can aid in identifying mutations and predicting their impact on AMR. The study aimed to use ML models to predict ceftriaxone susceptibility and decreased susceptibility (S/DS). A public database of N. gonorrhoeae genomes with minimum inhibitory concentration (MIC) data was used to evaluate seven ML models using 97 single nucleotide polymorphisms (SNPs) known to be associated with ceftriaxone resistance. Ceftriaxone MICs ≤ 0.064 mg/L were classified as susceptible, and ceftriaxoneMICs > 0.064 mg/L were classified as DS. The contributions of individual SNPs to predict S/DS were calculated using SHapley Additive exPlanation (SHAP) values. An ML model was retrained using different combinations of SNPs with the highest SHAP values. The performance of ML models was assessed using different metrics including area under the curve (AUC) and balanced accuracy (bAcc). The ML analyses included 9,540 N. gonorrhoeae genomes; 368 (0.04%) were classified as DS. Of the models evaluated, the model trained with a random forest classifier had the highest performance (AUC 0.965; bAcc 0.926). A model retrained the top five SNPs, according to SHAP values, demonstrated a similar performance (AUC 0.916; bAcc 0.879) as the model with 97 SNPs. An ML approach using mutations in N. gonorrhoeae can be used to predict S/DS to ceftriaxone. The results highlight a practical application of ML to identify mutations most associated with S/DS to ceftriaxone, which can aid in the development of assays to predict AMR. IMPORTANCE Antimicrobial resistance in Neisseria gonorrhoeae is an urgent global health issue. The objectives of the study were to use a global collection of 12,936 N. gonorrhoeae genomes from the PathogenWatch database to evaluate different machine learning models to predict ceftriaxone susceptibility/decreased susceptibility using 97 mutations known to be associated with ceftriaxone resistance. We found the random forest classifier model had the highest performance. The analysis also reported the relative contributions of different mutations within the ML model predictions, allowing for the identification of the mutations with the highest importance for ceftriaxone resistance. A machine learning model retrained with the top five mutations performed similarly to the model using all 97 mutations. These results could aid in the development of molecular tests to detect resistance to ceftriaxone in N. gonorrhoeae. Moreover, this approach could be applied to building and evaluating machine learning models for predicting antimicrobial resistance in other pathogens.
- Research Article
- 10.46647/ijetms.2022.v06i05.126
- Sep 28, 2022
- international journal of engineering technology and management sciences
Due to the excellent performance of ML models when using complicated big data, the rise of machine learning (ML) applications has significantly accelerated the deployment of personalised medicine techniques for better health care over the past ten years. Precision medicine applications in clinical research and practise show significant promise when new ML approaches are used to the clinical investigation of chronic inflammatory illnesses. In this study, we emphasise the clinical uses of different ML approaches for prognosis, diagnosis, and prediction. Big data and ML algorithms can be used to identify the precise medicine for each individual based on their clinical, laboratory, nutrition and lifestyle-related data.The main objective of this invention is about creating an application that suggest the precision medicine to the patients. This application will really helpful is finding a best medicine to the patients based on their body condition and their genomics. This application will reduce the risk of trial and error by finding the best medicine for every individual.
- Research Article
2
- 10.1159/000538639
- Apr 22, 2024
- Cardiology
Introduction: Heart failure (HF) is a major global public health concern. The application of machine learning (ML) to identify individuals at high risk and enable early intervention is a promising approach for improving HF prognosis. We aim to systematically evaluate the performance and value of ML models for predicting HF prognosis. Methods: PubMed, Web of Science, Scopus, and Embase online databases were searched up to April 30, 2023, to identify studies on the use of ML models to predict HF prognosis. HF prognosis primarily encompasses readmission and mortality. The meta-analysis was conducted by MedCalc software. Subgroup analyses include grouping based on types of ML models, time intervals, sample sizes, the number of predictive variables, validation methods, whether to conduct hyperparameter optimization and calibration, data set partitioning methods. Results: A total of 31 studies were included. The most common ML models were random forest, boosting, support vector machine, neural network. The area under the receiver operating characteristic curve (AUC) for predicting HF readmission was 0.675 (95% CI: 0.651–0.699, p < 0.001), and the AUC for predicting HF mortality was 0.790 (95% CI: 0.765–0.816, p < 0.001). Subgroup analyses revealed that models with the prediction time interval of 1 year, sample sizes ≥10,000, the number of predictive variables ≥100, external validation, hyperparameter tuning, calibration adjustment, and data set partitioning using 10-fold cross-validation exhibited favorable performance within their respective subgroups. Conclusion: The performance of ML models in predicting HF readmission is relatively poor, while its performance in predicting HF mortality is moderate. The quality of the relevant studies is generally low, it is essential to enhance the predictive capabilities of ML models through targeted improvements in practical applications.
- Research Article
31
- 10.1213/ane.0000000000004656
- Jun 1, 2020
- Anesthesia & Analgesia
Machine-Learning Implementation in Clinical Anesthesia: Opportunities and Challenges.
- Research Article
- 10.37934/araset.32.2.404416
- Sep 7, 2023
- Journal of Advanced Research in Applied Sciences and Engineering Technology
Coronavirus disease (COVID-19) has become a serious worldwide health concern affecting the respiratory system since December 2019. Computed Tomography (CT) image analysis and identification are powerful tools for diagnosing COVID-19. However, due to the disparity of the distribution and form of COVID-19 infection and the diverse degrees of infection severity, the classification of the CT images is challenging, especially manually. Therefore, this study aims to employ artificial intelligence techniques, including machine learning and deep learning algorithms, to classify COVID-19 from CT images. The grey-level co-occurrence matrix (GLCM) features were computed and fed into machine learning classifiers, namely Support Vector Machine, Random Forest, K-Nearest Neighbour, Logistic Regression, and Naïve Bayes model for training purposes. Deep learning models, including ResNet50, Densenet121, Inception, and VGG16, were trained using raw data scaled and transformed to greyscale mode. The performances of machine learning and deep learning models were assessed on the testing data. Random Forests, ResNet50, and DenseNet121 outperform all the other models by achieving 100% accuracy, precision, sensitivity, and specificity on the dataset applied. The performance of machine learning models can be further improved by obtaining the optimised parameters in future research.
- Research Article
10
- 10.1124/jpet.122.001551
- Aug 31, 2023
- The Journal of pharmacology and experimental therapeutics
As pharmaceutical development moves from early-stage in vitro experimentation to later in vivo and subsequent clinical trials, data and knowledge are acquired across multiple time and length scales, from the subcellular to whole patient cohort scale. Realizing the potential of this data for informing decision making in pharmaceutical development requires the individual and combined application of machine learning (ML) and mechanistic multiscale mathematical modeling approaches. Here we outline how these two approaches, both individually and in tandem, can be applied at different stages of the drug discovery and development pipeline to inform decision making compound development. The importance of discerning between knowledge and data are highlighted in informing the initial use of ML or mechanistic quantitative systems pharmacology (QSP) models. We discuss the application of sensitivity and structural identifiability analyses of QSP models in informing future experimental studies to which ML may be applied, as well as how ML approaches can be used to inform mechanistic model development. Relevant literature studies are highlighted and we close by discussing caveats regarding the application of each approach in an age of constant data acquisition. SIGNIFICANCE STATEMENT: We consider when best to apply machine learning (ML) and mechanistic quantitative systems pharmacology (QSP) approaches in the context of the drug discovery and development pipeline. We discuss the importance of prior knowledge and data available for the system of interest and how this informs the individual and combined application of ML and QSP approaches at each stage of the pipeline.
- Research Article
35
- 10.1186/s40537-022-00676-2
- Jan 28, 2023
- Journal of Big Data
BackgroundWhen you make a forex transaction, you sell one currency and buy another. If the currency you buy increases against the currency you sell, you profit, and you do this through a broker as a retail trader on the internet using a platform known as meta trader. Only 2% of retail traders can successfully predict currency movement in the forex market, making it one of the most challenging tasks. Machine learning and its derivatives or hybrid models are becoming increasingly popular in market forecasting, which is a rapidly developing field.ObjectiveWhile the research community has looked into the methodologies used by researchers to forecast the forex market, there is still a need to look into how machine learning and artificial intelligence approaches have been used to predict the forex market and whether there are any areas that can be improved to allow for better predictions. Our objective is to give an overview of machine learning models and their application in the FX market.MethodThis study provides a Systematic Literature Review (SLR) of machine learning algorithms for FX market forecasting. Our research looks at publications that were published between 2010 and 2021. A total of 60 papers are taken into consideration. We looked at them from two angles: I the design of the evaluation techniques, and (ii) a meta-analysis of the performance of machine learning models utilizing evaluation metrics thus far.ResultsThe results of the analysis suggest that the most commonly utilized assessment metrics are MAE, RMSE, MAPE, and MSE, with EURUSD being the most traded pair on the planet. LSTM and Artificial Neural Network are the most commonly used machine learning algorithms for FX market prediction. The findings also point to many unresolved concerns and difficulties that the scientific community should address in the future.ConclusionBased on our findings, we believe that machine learning approaches in the area of currency prediction still have room for development. Researchers interested in creating more advanced strategies might use the open concerns raised in this work as input.
- Research Article
11
- 10.1007/s11837-020-04423-x
- Nov 10, 2020
- JOM
The development of machine learning (ML) approaches in materials science offers the opportunity to exploit existing engineering and developmental alloy datasets, such as Oak Ridge National Laboratory (ORNL)’s consistently measured creep-rupture dataset for alumina-forming austenitic (AFA) alloys, to accelerate their further development. As a first step toward achieving ML insights for improved alloy design, the potential sources of uncertainty and their impacts on ML output are examined. It is observed that the selection of algorithms and features as well as data sampling significantly affects the performance of ML models, either positively or negatively. The performance of various ML models in predicting the creep properties of AFA alloys is compared, with further evaluation by assessment of a small set of new developmental AFA alloys that were not part of the training dataset. The present study demonstrates that uncertainty quantification (UQ) is essential in materials science for evaluating the performance of ML algorithms with specifically selected feature sets and obtaining a comprehensive understanding of their limitations and the resultant capability of effective prediction in complex materials systems.
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.