Predict Performance Metrics Research Articles

Abstract Pellet quality is a crucial key performance indicator (KPI) for commercial feed manufacturing, which influences both the efficiency of the feed mill and downstream performance of animals fed these diets. However, due to the complexity of feed manufacturing and the large number of factors involved in the manufacturing process, controlling pellet quality is an ongoing challenge for the feed industry. Previous studies have mainly explored the impact of a few factors on pellet quality under experimental settings, and empirical equations have been seldomly developed to reflect the relationship between the factors and pellet quality under the commercial feed mill settings. This study aimed to establish a relationship between pellet quality and factors collected under the settings of a commercial feed mill. The data were collected from Trouw Nutrition Canada’s feed mill located in St. Marys, Ontario (Plant 2), between December 15, 2021, and December 6, 2022. During this period, 2,691 observations were collected, with each observation representing an individual batch of pelleted feed. A total of 75 factors were recorded, including 4 factors associated with the general information of each batch, 10 manufacturing parameters, 41 feed ingredients, 8 factors regarding the nutrient composition of each diet, and 12 environmental factors. Pellet Durability Index (PDI), which was the response variable, was determined for each batch using the Holmen method. The data were randomly split into an 80% subset for training and a 20% subset for testing. The training subset was used to construct the model via a 5-fold cross-validation, while the testing subset was withheld as an independent dataset to evaluate the generalization performance of the model. The response variable (PDI) was transformed (tPDI) using the Box-Cox method to meet a normal distribution assumption. To avoid multicollinearity, Principal Component Analysis (PCA) was used to reduce the dimensionality of the numeric factors before building the multiple linear regression model. The model prediction performance was evaluated on both the training subset (using 5-fold cross-validation) and the testing subset, and the prediction performance metrics were consistent between the two subsets (Mean Absolute Error = 1.94 ± 0.102 vs. 2.02; Root Mean Square Prediction Error = 2.47 ± 0.111 vs. 2.58; Mean Square Prediction Error = 6.12 ± 0.538 vs. 6.68; Concordance Correlation Coefficient = 0.538 ± 0.0231 vs. 0.490; Pearson Correlation Coefficient = 0.606 ± 0.0247 vs. 0.553, respectively). Most feed ingredients and nutrient compositions showed either positive or negative loadings on Component 1 (17.87% of total variance), and outdoor/indoor environmental factors were positively loaded on Component 2 (14.21% of total variance). The model developed in this study could help commercial feed mills better understand how various factors impact pellet quality and optimize the manufacturing processes of pelleted feeds.

Read full abstract

IntroductionColorectal cancer (CRC), also known as colorectal cancer, is a significant disease marked by high fatality rates, ranking as the third leading cause of global mortality. The main objective of this study was to assess the accuracy of predictive models in predicting both mortality events and the probability of disease recurrence. MethodA retrospective analysis was conducted on a cohort of 284 individuals diagnosed with colorectal cancer between 2001 and 2017. Demographic and clinical data, including gender, disease stage, age at diagnosis, recurrence status, and treatment details, were meticulously recorded. We rigorously evaluated various predictive models, including Decision Trees, Random Forests, Random Survival Forests (RSF), Gradient Boosting, mboost, Deep Learning Neural Network (DLNN), and Cox regression. Performance metrics, such as sensitivity, positive predictive value (PPV), specificity, area under the receiver operating characteristic curve (ROC area), and overall accuracy, were calculated for each model to predict mortality and disease recurrence. The analysis was performed using R version 4.1.3 software and the Python programming language. ResultsFor mortality prediction, the mboost model demonstrated the highest sensitivity at 96.9% (95% CI: 0.83–0.99) and an ROC area of 0.88. It also exhibited high specificity at 80% (95% CI: 0.59–0.93), a positive predictive value of 86.1% (95% CI: 0.70–0.95), and an overall accuracy of 89% (95% CI: 0.78–0.96). Random Forests showed perfect sensitivity of 100% (95% CI: 0.85–1) but had low specificity at 0% (95% CI: 0–0.52) and poor overall accuracy (50%). On the other hand, DLNN had the lowest performance metrics for mortality prediction, with a sensitivity of 24% (95% CI: 0.222–0.268), specificity of 75% (95% CI: 0.73–0.77), and a lower positive predictive value of 42% (95% CI: 0.38–0.45). The Gradient Boosting model showed the best performance in predicting recurrence, achieving perfect sensitivity of 100% (95% CI: 0.87–1) and high specificity at 92.9% (95% CI: 0.76–0.99). It also had a high positive predictive value of 93.3% (95% CI: 0.77–0.99). Gradient Boosting, with an ROC area of 96.4%, and mboost, with an ROC area of 75%, demonstrated remarkable performance. DLNN had the lowest performance metrics for recurrence prediction, with sensitivity at 1.75% (95% CI: 0.01–0.02), specificity at 98% (95% CI: 0.97–0.98), and a lower positive predictive value at 52.6% (95% CI: 0.39–0.65). ConclusionIn summary, the mboost model demonstrated outstanding performance in predicting mortality, achieving exceptional results across various evaluation metrics. Random Forests exhibited perfect sensitivity but showed poor specificity and overall accuracy. The DLNN model displayed the lowest performance metrics for mortality prediction. In terms of recurrence prediction, the Gradient Boosting model outperformed other models with perfect sensitivity, high specificity, and positive predictive value. The DLNN model had the lowest performance metrics for recurrence prediction. Overall, the results emphasize the effectiveness of the mboost and Gradient Boosting models in predicting mortality and recurrence in colorectal cancer patients.

Read full abstract

Predict Performance Metrics Research Articles

Related Topics

Articles published on Predict Performance Metrics

Limits of a single surrogate model development methodology to represent housing stocks

AI-Powered Post-Discharge Monitoring to Prevent Patients Readmissions and Reduce Workforce Burden

PRO-SMOTEBoost: An adaptive SMOTEBoost probabilistic algorithm for rebalancing and improving imbalanced data classification

493 Predicting pellet quality using multiple linear regression with Principal Component Analysis (PCA)

Ensemble of naive Bayes, decision tree, and random forest to predict air quality

(Invited) Atomic-Scale Modeling of Charge-Transfer Kinetics at LixCoO2 Cathode-Electrolyte Interfaces

Kinetics prediction of normal knee and undergone total knee arthroplasty during squatting based on extreme gradient boosting

Evaluating electrical power yield of photovoltaic solar cells with k-Nearest neighbors: A machine learning statistical analysis approach

Detecting defects that reduce breakdown voltage using machine learning and optical profilometry

Predicting mortality and recurrence in colorectal cancer: Comparative assessment of predictive models

Integration of risk factor polygenic risk score with disease polygenic risk score for disease prediction

Expert-augmented machine learning to accelerate the discovery of copolymers for anion exchange membrane

Modeling influence of weather variables on energy consumption in an agricultural research institute in Ibadan, Nigeria

Humidification potential optimization of various membranes for proton exchange membrane fuel cell: Experiments and deep learning assisted metaheuristics

Reliable AI models can reveal key processes of heat recovery steam generator operation in air pollutant emission

A study of machine learning to predict NRDS severity based on lung ultrasound score and clinical indicators.

Computer vision model for the detection of canine pododermatitis and neoplasia of the paw.

Development and validation of age-specific risk prediction models for primary ovarian insufficiency in long-term survivors of childhood cancer: a report from the Childhood Cancer Survivor Study and St Jude Lifetime Cohort

Development and Validation of a Model to Quantify Injury Severity in Real Time

Comparison of known spawner abundance from fence counts to visual counts for simplified spawner estimation methods

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Predict Performance Metrics Research Articles

Related Topics

Articles published on Predict Performance Metrics

Limits of a single surrogate model development methodology to represent housing stocks

AI-Powered Post-Discharge Monitoring to Prevent Patients Readmissions and Reduce Workforce Burden

PRO-SMOTEBoost: An adaptive SMOTEBoost probabilistic algorithm for rebalancing and improving imbalanced data classification

493 Predicting pellet quality using multiple linear regression with Principal Component Analysis (PCA)

Ensemble of naive Bayes, decision tree, and random forest to predict air quality

(Invited) Atomic-Scale Modeling of Charge-Transfer Kinetics at LixCoO2 Cathode-Electrolyte Interfaces

Kinetics prediction of normal knee and undergone total knee arthroplasty during squatting based on extreme gradient boosting

Evaluating electrical power yield of photovoltaic solar cells with k-Nearest neighbors: A machine learning statistical analysis approach

Detecting defects that reduce breakdown voltage using machine learning and optical profilometry

Predicting mortality and recurrence in colorectal cancer: Comparative assessment of predictive models

Integration of risk factor polygenic risk score with disease polygenic risk score for disease prediction

Expert-augmented machine learning to accelerate the discovery of copolymers for anion exchange membrane

Modeling influence of weather variables on energy consumption in an agricultural research institute in Ibadan, Nigeria

Humidification potential optimization of various membranes for proton exchange membrane fuel cell: Experiments and deep learning assisted metaheuristics

Reliable AI models can reveal key processes of heat recovery steam generator operation in air pollutant emission

A study of machine learning to predict NRDS severity based on lung ultrasound score and clinical indicators.

Computer vision model for the detection of canine pododermatitis and neoplasia of the paw.

Development and validation of age-specific risk prediction models for primary ovarian insufficiency in long-term survivors of childhood cancer: a report from the Childhood Cancer Survivor Study and St Jude Lifetime Cohort

Development and Validation of a Model to Quantify Injury Severity in Real Time

Comparison of known spawner abundance from fence counts to visual counts for simplified spawner estimation methods