Build Machine Learning Models Research Articles

Introduction:Advances in wearable sensor technology have enabled the collection of biomarkers that may correlate with levels of elevated stress. While significant research has been done in this domain, specifically in using machine learning to detect elevated levels of stress, the challenge of producing a machine learning model capable of generalizing well for use on new, unseen data remain. Acute stress response has both subjective, psychological and objectively measurable, biological components that can be expressed differently from person to person, further complicating the development of a generic stress measurement model. Another challenge is the lack of large, publicly available datasets labeled for stress response that can be used to develop robust machine learning models. In this paper, we first investigate the generalization ability of models built on datasets containing a small number of subjects, recorded in single study protocols. Next, we propose and evaluate methods combining these datasets into a single, large dataset to study the generalization capability of machine learning models built on larger datasets. Finally, we propose and evaluate the use of ensemble techniques by combining gradient boosting with an artificial neural network to measure predictive power on new, unseen data. In favor of reproducible research and to assist the community advance the field, we make all our experimental data and code publicly available through Github at https://github.com/xalentis/Stress. This paper’s in-depth study of machine learning model generalization for stress detection provides an important foundation for the further study of stress response measurement using sensor biomarkers, recorded with wearable technologies. Methods:Sensor biomarker data from six public datasets were utilized in this study. Exploratory data analysis was performed to understand the physiological variance between study subjects, and the complexity it introduces in building machine learning models capable of detecting elevated levels of stress on new, unseen data. To test model generalization, we developed a gradient boosting model trained on one dataset (SWELL), and tested its predictive power on two datasets previously used in other studies (WESAD, NEURO). Next, we merged four small datasets, i.e. (SWELL, NEURO, WESAD, UBFC-Phys), to provide a combined total of 99 subjects, and applied feature engineering to generate additional features utilizing statistical summaries, with sliding windows of 25 s. We name this large dataset, StressData. In addition, we utilized random sampling on StressData combined with another dataset (EXAM) to build a larger training dataset consisting of 200 synthesized subjects, which we name SynthesizedStressData. Finally, we developed an ensemble model that combines our gradient boosting model with an artificial neural network, and tested it using Leave-One-Subject-Out (LOSO) validation, and on two additional, unseen publicly available stress biomarker datasets (WESAD and Toadstool). Results:Our results show that previous models built on datasets containing a small number (<50) of subjects, recorded in single study protocols, cannot generalize well to new, unseen datasets. Our presented methodology for generating a large, synthesized training dataset by utilizing random sampling to construct scenarios closely aligned with experimental conditions demonstrate significant benefits. When combined with feature-engineering and ensemble learning, our method delivers a robust stress measurement system capable of achieving 85% predictive accuracy on new, unseen validation data, achieving a 25% performance improvement over single models trained on small datasets. The resulting model can be used as both a classification or regression predictor for estimating the level of perceived stress, when applied on specific sensor biomarkers recorded using a wearable device, while further allowing researchers to construct large, varied datasets for training machine learning models that closely emulate their exact experimental conditions. Conclusion:Models trained on small, single study protocol datasets do not generalize well for use on new, unseen data and lack statistical power. Machine learning models trained on a dataset containing a larger number of varied study subjects capture physiological variance better, resulting in more robust stress detection. Feature-engineering assists in capturing these physiological variance, and this is further improved by utilizing ensemble techniques by combining the predictive power of different machine learning models, each capable of learning unique signals contained within the data. While there is a general lack of large, labeled public datasets that can be utilized for training machine learning models capable of accurately measuring levels of acute stress, random sampling techniques can successfully be applied to construct larger, varied datasets from these smaller sample datasets, for building robust machine learning models.

Read full abstract

Since the World Health Organization declared COVID-19 a pandemic in 2020, the global community has faced ongoing challenges in controlling and mitigating the transmission of the SARS-CoV-2 virus, as well as its evolving subvariants and recombinants. A significant challenge during the pandemic has not only been the accurate detection of positive cases but also the efficient prediction of risks associated with complications and patient survival probabilities. These tasks entail considerable clinical resource allocation and attention. In this study, we introduce COVID-Net Biochem, a versatile and explainable framework for constructing machine learning models. We apply this framework to predict COVID-19 patient survival and the likelihood of developing Acute Kidney Injury during hospitalization, utilizing clinical and biochemical data in a transparent, systematic approach. The proposed approach advances machine learning model design by seamlessly integrating domain expertise with explainability tools, enabling model decisions to be based on key biomarkers. This fosters a more transparent and interpretable decision-making process made by machines specifically for medical applications. More specifically, the framework comprises two phases: In the first phase, referred to as the “clinician-guided design” phase, the dataset is preprocessed using explainable AI and domain expert input. To better demonstrate this phase, we prepared a benchmark dataset of carefully curated clinical and biochemical markers based on clinician assessments for survival and kidney injury prediction in COVID-19 patients. This dataset was selected from a patient cohort of 1366 individuals at Stony Brook University. Moreover, we designed and trained a diverse collection of machine learning models, encompassing gradient-based boosting tree architectures and deep transformer architectures, specifically for survival and kidney injury prediction based on the selected markers. In the second phase, called the “explainability-driven design refinement” phase, the proposed framework employs explainability methods to not only gain a deeper understanding of each model’s decision-making process but also to identify the overall impact of individual clinical and biochemical markers for bias identification. In this context, we used the models constructed in the previous phase for the prediction task and analyzed the explainability outcomes alongside a clinician with over 8 years of experience to gain a deeper understanding of the clinical validity of the decisions made. The explainability-driven insights obtained, in conjunction with the associated clinical feedback, are then utilized to guide and refine the training policies and architectural design iteratively. This process aims to enhance not only the prediction performance but also the clinical validity and trustworthiness of the final machine learning models. Employing the proposed explainability-driven framework, we attained 93.55% accuracy in survival prediction and 88.05% accuracy in predicting kidney injury complications. The models have been made available through an open-source platform. Although not a production-ready solution, this study aims to serve as a catalyst for clinical scientists, machine learning researchers, and citizen scientists to develop innovative and trustworthy clinical decision support solutions, ultimately assisting clinicians worldwide in managing pandemic outcomes.

Read full abstract

Build Machine Learning Models Research Articles

Related Topics

Articles published on Build Machine Learning Models

Predicting Binding Affinity Between MHC-I Receptor and Peptides Based on Molecular Docking and Protein-peptide Interaction Interface Characteristics

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices

Comprehensive Decision-Making Guide Predicting Colleges Based on User Profile Using Ensemble Machine Learning Model

Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach

NIMG-39. BENCHMARKING MULTI-LABEL PREDICTION OF TARGET GENE MUTATION STATUS IN GLIOBLASTOMA USING RADIOGENOMICS AND MULTI-PARAMETRIC MR IMAGING

High-resolution magnetic resonance imaging-based radiomic features aid in selecting endovascular candidates among patients with cerebral venous sinus thrombosis

A knowledge graph-based data harmonization framework for secondary data reuse

Revealing the impact of TOX3 on osteoarthritis: insights from bioinformatics.

A Data-centric AI Framework for Automating Exploratory Data Analysis and Data Quality Tasks

Depression and anxiety have distinct and overlapping language patterns: Results from a clinical interview.

A machine learning approach to graduate admissions and the role of letters of recommendation.

Pay Later Risk Management: A Review of FMECA and Potential Customer Prediction Frameworks Through the Application of Machine Learning

Predicting rock hardness using Gaussian weighted moving average filter on borehole data and machine learning

Understanding Partnership Formation and Repeated Contributions in Federated Learning: An Analytical Investigation

Prediction of Surgical Upstaging Risk of Ductal Carcinoma In Situ Using Machine Learning Models.

Predicting open education competency level: A machine learning approach

SportsTables: A New Corpus for Semantic Type Detection (Extended Version)

A Comparative Investigation of Random Forest Regression and Artificial Neural Networks for Predicting Crack Growth Life of a Fighter Aircraft Wing Joint Under Spectrum Loading

COVID-Net Biochem: an explainability-driven framework to building machine learning models for predicting survival and kidney injury of COVID-19 patients from clinical and biochemistry data

Local Industrialization Based Lucrative Farming Using Machine Learning Technique

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Build Machine Learning Models Research Articles

Related Topics

Articles published on Build Machine Learning Models

Predicting Binding Affinity Between MHC-I Receptor and Peptides Based on Molecular Docking and Protein-peptide Interaction Interface Characteristics

Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices

Comprehensive Decision-Making Guide Predicting Colleges Based on User Profile Using Ensemble Machine Learning Model

Addressing the data bottleneck in medical deep learning models using a human-in-the-loop machine learning approach

NIMG-39. BENCHMARKING MULTI-LABEL PREDICTION OF TARGET GENE MUTATION STATUS IN GLIOBLASTOMA USING RADIOGENOMICS AND MULTI-PARAMETRIC MR IMAGING

High-resolution magnetic resonance imaging-based radiomic features aid in selecting endovascular candidates among patients with cerebral venous sinus thrombosis

A knowledge graph-based data harmonization framework for secondary data reuse

Revealing the impact of TOX3 on osteoarthritis: insights from bioinformatics.

A Data-centric AI Framework for Automating Exploratory Data Analysis and Data Quality Tasks

Depression and anxiety have distinct and overlapping language patterns: Results from a clinical interview.

A machine learning approach to graduate admissions and the role of letters of recommendation.

Pay Later Risk Management: A Review of FMECA and Potential Customer Prediction Frameworks Through the Application of Machine Learning

Predicting rock hardness using Gaussian weighted moving average filter on borehole data and machine learning

Understanding Partnership Formation and Repeated Contributions in Federated Learning: An Analytical Investigation

Prediction of Surgical Upstaging Risk of Ductal Carcinoma In Situ Using Machine Learning Models.

Predicting open education competency level: A machine learning approach

SportsTables: A New Corpus for Semantic Type Detection (Extended Version)

A Comparative Investigation of Random Forest Regression and Artificial Neural Networks for Predicting Crack Growth Life of a Fighter Aircraft Wing Joint Under Spectrum Loading

COVID-Net Biochem: an explainability-driven framework to building machine learning models for predicting survival and kidney injury of COVID-19 patients from clinical and biochemistry data

Local Industrialization Based Lucrative Farming Using Machine Learning Technique