K-fold Cross-validation Approach Research Articles

BackgroundChronic obstructive pulmonary disease (COPD) is a severe condition affecting millions worldwide, leading to numerous annual deaths. The absence of significant symptoms in its early stages promotes high underdiagnosis rates for the affected people. Besides pulmonary function failure, another harmful problem of COPD is the systemic effects, e.g., heart failure or voice distortion. However, the systemic effects of COPD might provide valuable information for early detection. In other words, symptoms caused by systemic effects could be helpful to detect the condition in its early stages. ObjectiveThe proposed study aims to explore whether the voice features extracted from the vowel “a” utterance carry any information that can be predictive of COPD by employing Machine Learning (ML) on a newly collected voice dataset. MethodsForty-eight participants were recruited from the pool of research clinic visitors at Blekinge Institute of Technology (BTH) in Sweden between January 2022 and May 2023. A dataset consisting of 1246 recordings from 48 participants was gathered. The collection of voice recordings containing the vowel “a” utterance commenced following an information and consent meeting with each participant using the VoiceDiagnostic application. The collected voice data was subjected to silence segment removal, feature extraction of baseline acoustic features, and Mel Frequency Cepstrum Coefficients (MFCC). Sociodemographic data was also collected from the participants. Three ML models were investigated for the binary classification of COPD and healthy controls: Random Forest (RF), Support Vector Machine (SVM), and CatBoost (CB). A nested k-fold cross-validation approach was employed. Additionally, the hyperparameters were optimized using grid-search on each ML model. For best performance assessment, accuracy, F1-score, precision, and recall metrics were computed. Afterward, we further examined the best classifier by utilizing the Area Under the Curve (AUC), Average Precision (AP), and SHapley Additive exPlanations (SHAP) feature-importance measures. ResultsThe classifiers RF, SVM, and CB achieved a maximum accuracy of 77 %, 69 %, and 78 % on the test set and 93 %, 78 % and 97 % on the validation set, respectively. The CB classifier outperformed RF and SVM. After further investigation of the best-performing classifier, CB demonstrated the highest performance, producing an AUC of 82 % and AP of 76 %. In addition to age and gender, the mean values of baseline acoustic and MFCC features demonstrate high importance and deterministic characteristics for classification performance in both test and validation sets, though in varied order. ConclusionThis study concludes that the utterance of vowel “a” recordings contain information that can be captured by the CatBoost classifier with high accuracy for the classification of COPD. Additionally, baseline acoustic and MFCC features, in conjunction with age and gender information, can be employed for classification purposes and benefit healthcare for decision support in COPD diagnosis. Clinical trial registration numberNCT05897944.

Read full abstract

Compressive strength (CS) of concrete is one of the most important factors in the construction industry and various time and effort-consuming tasks are required to measure it. To tackle such problems, the use of machine learning (ML), a branch of artificial intelligence, has recently resulted in a dramatic revolution in the construction sector, resulting in increased efficiency, accuracy, and creativity. Taking these factors into consideration, the current research was conducted on concrete manufactured with recycled coarse aggregate and fly ash generated as a byproduct of construction and demolition activities and thermal power plants. A large dataset consisting of 444 data points, along with ten input parameters, has been collected from the literature to forecast the CS of fly ash and recycled coarse aggregate-based self-compacting concrete. In this regard, ten advanced ML models, including K-Nearest Neighbors (KNN), Extra Tree Regressor (ETR), Bagging Regressor (BR), Adaboost Regressor (AR), Extreme Gradient Boosting (XGB), Linear Regression (LR), Random Forest (RF), Decision Tree Regression (DTR), Support Vector Regression (SVR) and Gradient Boosting Regression (GBR) have been considered. Furthermore, various data visualization plots and model’s performance matrices such as scatter plot, histograms, heatmaps, Shapley Additive Explanation (SHAP) Analysis, Regression Error Characteristics (REC), and errors have been utilized. In order to evaluate the most influential input parameter and depict the overall performance of ML models, sensitivity analysis and Taylor’s diagram are used. As a method of validation, the Kfold cross-validation approach has been implemented to justify the obtained output. Based on the outcome of the study, the BR model has displayed remarkable accuracy with insignificant errors and high R-squared values (R2 = 0.961), while XGB (R2 = 0.959), and DTR (R2 = 0.952) models also achieved commendable, as compared to other ML models. Additionally, water content, curing days, fly ash, and w/c ratio were found to be the most critical components that directly impact the CS of fly ash and RCA-based SCC. To cater to diverse and extensive practices, a graphical user interface has been developed to assist researchers and engineers in getting instant results of their fly ash and RCA-based SCC mixes prior to the execution of time- and resource-consuming laboratory work.

Read full abstract

K-fold Cross-validation Approach Research Articles

Related Topics

Articles published on K-fold Cross-validation Approach

COPDVD: Automated classification of chronic obstructive pulmonary disease on a new collected and evaluated voice dataset

Optimizing Glioblastoma, IDH-wildtype Treatment Outcomes : A Radiomics and Support Vector Machine -Based Approach to Overall Survival Estimation.

A comparative study of acoustic and ultrasonic nondestructive testing for evaluating melon quality

Structural Condition Assessment of Steel Anchorage Using Convolutional Neural Networks and Admittance Response

A novel data-driven machine learning techniques to predict compressive strength of fly ash and recycled coarse aggregates based self-compacting concrete

Mathematical Modeling Using ANN Based on k-fold Cross Validation Approach and MOAHA Multi-Objective Optimization Algorithm During Turning of Polyoxymethylene POM-C

Effect of social capital, social support and social network formation on the quality of life of American adults during COVID-19

RAGN-L: A stacked ensemble learning technique for classification of Fire-Resistant columns

Hyperspectral imaging benchmark based on machine learning for intraoperative brain tumour detection

Automated tuning of denoising algorithms for noise removal in chromatograms

Strength evaluation sustainable concrete with waste ingredients at elevated temperature by employing interpretable algorithms: Optimization and hyper tuning

A Novel Methodology for Human Kinematics Motion Detection Based on Smartphones Sensor Data Using Artificial Intelligence

A Comparison of Modeling Methods for Predicting Forest Attributes Using Lidar Metrics

A Highly Accurate Dysphonia Detection System Using Linear Discriminant Analysis

Cryptojacking Malware Detection in Docker Images Using Supervised Machine Learning

Multi-label classification of Alzheimer's disease stages from resting-state fMRI-based correlation connectivity data and deep learning

Application of Soft-Computing Methods to Evaluate the Compressive Strength of Self-Compacting Concrete.

Application of data-mining technique and hydro-chemical data for evaluating vulnerability of groundwater in Indo-Gangetic Plain.

Quantitative chest computed tomography combined with plasma cytokines predict outcomes in COVID-19 patients

Predicting the Rheological Properties of Super-Plasticized Concrete Using Modeling Techniques.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

K-fold Cross-validation Approach Research Articles

Related Topics

Articles published on K-fold Cross-validation Approach

COPDVD: Automated classification of chronic obstructive pulmonary disease on a new collected and evaluated voice dataset

Optimizing Glioblastoma, IDH-wildtype Treatment Outcomes : A Radiomics and Support Vector Machine -Based Approach to Overall Survival Estimation.

A comparative study of acoustic and ultrasonic nondestructive testing for evaluating melon quality

Structural Condition Assessment of Steel Anchorage Using Convolutional Neural Networks and Admittance Response

A novel data-driven machine learning techniques to predict compressive strength of fly ash and recycled coarse aggregates based self-compacting concrete

Mathematical Modeling Using ANN Based on k-fold Cross Validation Approach and MOAHA Multi-Objective Optimization Algorithm During Turning of Polyoxymethylene POM-C

Effect of social capital, social support and social network formation on the quality of life of American adults during COVID-19

RAGN-L: A stacked ensemble learning technique for classification of Fire-Resistant columns

Hyperspectral imaging benchmark based on machine learning for intraoperative brain tumour detection

Automated tuning of denoising algorithms for noise removal in chromatograms

Strength evaluation sustainable concrete with waste ingredients at elevated temperature by employing interpretable algorithms: Optimization and hyper tuning

A Novel Methodology for Human Kinematics Motion Detection Based on Smartphones Sensor Data Using Artificial Intelligence

A Comparison of Modeling Methods for Predicting Forest Attributes Using Lidar Metrics

A Highly Accurate Dysphonia Detection System Using Linear Discriminant Analysis

Cryptojacking Malware Detection in Docker Images Using Supervised Machine Learning

Multi-label classification of Alzheimer's disease stages from resting-state fMRI-based correlation connectivity data and deep learning

Application of Soft-Computing Methods to Evaluate the Compressive Strength of Self-Compacting Concrete.

Application of data-mining technique and hydro-chemical data for evaluating vulnerability of groundwater in Indo-Gangetic Plain.

Quantitative chest computed tomography combined with plasma cytokines predict outcomes in COVID-19 patients

Predicting the Rheological Properties of Super-Plasticized Concrete Using Modeling Techniques.