Class Of Statistical Models Research Articles

Introduction. The study addresses the challenge of utilizing human gut microbiome data for the early detection of colorectal cancer (CRC). The research emphasizes the potential of using machine learning techniques to analyze complex microbiome datasets, providing a non-invasive approach to identifying CRC-related microbial markers.Hypothesis/Gap Statement. The primary hypothesis is that a robust machine learning-based analysis of 16S rRNA microbiome data can identify specific microbial features that serve as effective biomarkers for CRC detection, overcoming the limitations of classical statistical models in high-dimensional settings.Aim. The primary objective of this study is to explore and validate the potential of the human microbiome, specifically in the colon, as a valuable source of biomarkers for colorectal cancer (CRC) detection and progression. The focus is on developing a classifier that effectively predicts the presence of CRC and normal samples based on the analysis of three previously published faecal 16S rRNA sequencing datasets.Methodology. To achieve the aim, various machine learning techniques are employed, including random forest (RF), recursive feature elimination (RFE) and a robust correlation-based technique known as the fuzzy forest (FF). The study utilizes these methods to analyse the three datasets, comparing their performance in predicting CRC and normal samples. The emphasis is on identifying the most relevant microbial features (taxa) associated with CRC development via partial dependence plots, i.e. a machine learning tool focused on explainability, visualizing how a feature influences the predicted outcome.Results. The analysis of the three faecal 16S rRNA sequencing datasets reveals the consistent and superior predictive performance of the FF compared to the RF and RFE. Notably, FF proves effective in addressing the correlation problem when assessing the importance of microbial taxa in explaining the development of CRC. The results highlight the potential of the human microbiome as a non-invasive means to detect CRC and underscore the significance of employing FF for improved predictive accuracy.Conclusion. In conclusion, this study underscores the limitations of classical statistical techniques in handling high-dimensional information such as human microbiome data. The research demonstrates the potential of the human microbiome, specifically in the colon, as a valuable source of biomarkers for CRC detection. Applying machine learning techniques, particularly the FF, is a promising approach for building a classifier to predict CRC and normal samples. The findings advocate for integrating FF to overcome the challenges associated with correlation when identifying crucial microbial features linked to CRC development.

Read full abstract

ObjectivesTo predict and identify the key demographic and clinical exposure factors associated with dental anxiety among young adults, and to compare if the traditional statistical modelling approach provides similar results to the machine learning (ML) approach in predicting factors for dental anxiety.MethodsA cross-sectional study of Western Illinois University students. Three survey instruments (sociodemographic questionnaire, modified dental anxiety scale (MDAS), and dental concerns assessment tool (DCA)) were distributed via email to the students using survey monkey. The dependent variable was the mean MDAS scores, while the independent variables were the sociodemographic and dental concern assessment variables. Multivariable analysis was done by comparing the classical statistical model and the machine learning model. The classical statistical modelling technique was conducted using the multiple linear regression analysis and the final model was selected based on Akaike information Criteria (AIC) using the backward stepwise technique while the machine learining modelling was performed by comparing two ML models: LASSO regression and extreme gradient boosting machine (XGBOOST) under 5-fold cross-validation using the resampling technique. All statistical analyses were performed using R version 4.1.3.ResultsThe mean MDAS was 13.73 ± 5.51. After careful consideration of all possible fitted models and their interaction terms the classical statistical approach yielded a parsimonious model with 13 predictor variables with Akaike Information Criteria (AIC) of 2376.4. For the ML approach, the Lasso regression model was the best-performing model with a mean RMSE of 0.617, R2 of 0.615, and MAE of 0.483. Comparing the variable selection of ML versus the classical statistical model, both model types identified 12 similar variables (out of 13) as the most important predictors of dental anxiety in this study population.ConclusionThere is a high burden of dental anxiety within this study population. This study contributes to reducing the knowledge gap about the impact of clinical exposure variables on dental anxiety and the role of machine learningin the prediction of dental anxiety. The predictor variables identified can be used to inform public health interventions that are geared towards eliminating the individual clinical exposure triggers of dental anxiety are recommended.

Read full abstract

Class Of Statistical Models Research Articles

Related Topics

Articles published on Class Of Statistical Models

Machine Learning-Based Prediction for Incident Hypertension Based on Regular Health Checkup Data: Derivation and Validation in 2 Independent Nationwide Cohorts in South Korea and Japan.

Robust prediction of colorectal cancer via gut microbiome 16S rRNA sequencing data.

Artificial intelligence in prostate cancer: The potential of machine learning models and neural networks to predict biochemical recurrence after robot-assisted radical prostatectomy

Development of Econometric Models for Financial Performance Forecasting in Companies

Predicting Economic Trends and Stock Market Prices with Deep Learning and Advanced Machine Learning Techniques

A comparative analysis of classical and machine learning methods for forecasting TB/HIV co-infection

Efficient First-Order Algorithms for Large-Scale, Non-Smooth Maximum Entropy Models with Application to Wildfire Science.

A censored quantile transformation model for Alzheimer’s Disease data with multiple functional covariates

Optimization of the Use of Cloud Computing Resources Using Exploratory Data Analysis and Machine Learning

Fusion surface models: 2+1d lattice models from fusion 2-categories

Simulation of functional additive and non-additive genetic effects using statistical estimates from quantitative genetic models

An Enhanced Long-Term Wind Speed Prediction Using Dynamic Unified Ensemble Learning and Data Assimilation Techniques: A Case Study in Tamil Nadu, India

Decomposition and Holt-Winters Enhanced by the Whale Optimization Algorithm for Forecasting the Amount of Water Inflow into the Large Dam Reservoirs in Southern Thailand

Identification of causal influences in quantum processes

Code-Free Machine Learning Approach for EVO-ICL Vault Prediction: A Retrospective Two-Center Study.

Geospatial assessment of landslide-prone areas in the southern part of Anambra State, Nigeria using classical statistical models

Predicting dental anxiety in young adults: classical statistical modelling approach versus machine learning approach

Identifying predictors of the tooth loss phenotype in a large periodontitis patient cohort using a machine learning approach

Limitations of Linear Cross-Entropy as a Measure for Quantum Advantage

Sequential estimation for mixture of regression models for heterogeneous population

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Class Of Statistical Models Research Articles

Related Topics

Articles published on Class Of Statistical Models

Machine Learning-Based Prediction for Incident Hypertension Based on Regular Health Checkup Data: Derivation and Validation in 2 Independent Nationwide Cohorts in South Korea and Japan.

Robust prediction of colorectal cancer via gut microbiome 16S rRNA sequencing data.

Artificial intelligence in prostate cancer: The potential of machine learning models and neural networks to predict biochemical recurrence after robot-assisted radical prostatectomy

Development of Econometric Models for Financial Performance Forecasting in Companies

Predicting Economic Trends and Stock Market Prices with Deep Learning and Advanced Machine Learning Techniques

A comparative analysis of classical and machine learning methods for forecasting TB/HIV co-infection

Efficient First-Order Algorithms for Large-Scale, Non-Smooth Maximum Entropy Models with Application to Wildfire Science.

A censored quantile transformation model for Alzheimer’s Disease data with multiple functional covariates

Optimization of the Use of Cloud Computing Resources Using Exploratory Data Analysis and Machine Learning

Fusion surface models: 2+1d lattice models from fusion 2-categories

Simulation of functional additive and non-additive genetic effects using statistical estimates from quantitative genetic models

An Enhanced Long-Term Wind Speed Prediction Using Dynamic Unified Ensemble Learning and Data Assimilation Techniques: A Case Study in Tamil Nadu, India

Decomposition and Holt-Winters Enhanced by the Whale Optimization Algorithm for Forecasting the Amount of Water Inflow into the Large Dam Reservoirs in Southern Thailand

Identification of causal influences in quantum processes

Code-Free Machine Learning Approach for EVO-ICL Vault Prediction: A Retrospective Two-Center Study.

Geospatial assessment of landslide-prone areas in the southern part of Anambra State, Nigeria using classical statistical models

Predicting dental anxiety in young adults: classical statistical modelling approach versus machine learning approach

Identifying predictors of the tooth loss phenotype in a large periodontitis patient cohort using a machine learning approach

Limitations of Linear Cross-Entropy as a Measure for Quantum Advantage

Sequential estimation for mixture of regression models for heterogeneous population