High R2 Values Research Articles

This paper proposes the utility of interpretable ensemble learning models for predicting the mechanical properties (bulk, shear and Young moduli) of ABX3 perovskite compounds with the A, B, and X referring to the 3 elements that make the cubic 3-dimensional framework of the perovskite compounds. These models consist of 3 ensemble learning techniques namely CatBoost, Random Forest, and XGBoost. To expand the feature space, robust first-principles density functional theory calculations were used to generate some of the input features, namely elastic constants, density, volume per atom, and ground state energy per atom. The order of the input feature ranking that influences the machine learning (ML) model decisions was then determined. For this, we performed correlation analysis on the multi-dimensional input feature space, suppressed features with high collinearity, and selected features with limited correlation. We trained the three ensemble learning techniques on the desired vectorial input feature representation to predict the mechanical properties. Furthermore, we employed the Shapley Additive Explanations (SHAP) algorithm for analysing the intrinsic decision-making rationality of the ensemble learning models. We measured the performance in the context of the error metrics and coefficient of determination, R2. The results show that XGBoost outperforms other approaches when predicting the shear modulus or Young modulus of the perovskite compounds yielding the least error metrics and the highest R2 value (0.97) in the testing phase. However, both CatBoost and Random Forest outperformed XGBoost when attempting to predict the bulk modulus in the testing phase. The deficiency of the XGBoost in predicting the bulk modulus can be ascribed to an overfitting problem which can occur when the ML model gives accurate predictions for training data but not for test data. Furthermore, the SHAP algorithm provides an insight into the order of feature importance (from highest to lowest). Additionally, we conducted a post-analysis using a holistic ranking to analyse the relative importance of the SHAP feature impact comprehension for the examined ensemble learning techniques. Our findings indicate that the elastic constants are the most important input features influencing the predictive decision of the ensemble learning models.

Read full abstract

AbstractEugenia uniflora is a tropical species rich in bioactive compounds that are highly sensitive to processing conditions, particularly those involving heat. Drying is a widely used method for preserving fruit pulp which can affect the stability of these bioactive compounds. The aim of this study was to assess the impact of drying at different temperatures on the retention of phenolic compounds and carotenoids present in the pulp of E. uniflora, with the goal of optimizing the drying process to preserve the pulp's quality. The E. uniflora pulp was dried in an oven with air circulation at three temperatures (45, 65, and 85°C). During the drying process, the moisture content and concentration of phenolics, β‐carotene, and lycopene were measured over time. After 260 min of drying, phenolic compounds decreased by 63.97% (45 and 65°C) and by 59.62% (85°C). Carotenoids losses were even more pronounced exceeding 89%, for all temperatures, with β‐carotene reductions of 92.91%, 90.72%, and 91.11%, at 45, 65, and 85°C, respectively. Several well‐established drying models were tested to represent the moisture content over time. Two models exhibited a high adherence to the experimental data. Zero‐order, first‐order, and second‐order degradation models were used to describe the concentrations of phenolic compounds and carotenoids. For total carotenoids, the model that showed the best results was temperature‐dependent. The first‐order model provided the best fit for β‐carotene and lycopene, with high R2 values of 0.9852 (45°C), 0.9776 (65°C), and 0.9681(85°C) for β‐carotene, and 0.9776 (45°C), 0.9715 (65°C), and 0.9659 (85°C) for lycopene. These results indicate that higher temperatures accelerate the degradation of bioactive compounds, following a predictable dynamic that can be optimized through adjustments to the drying process.Practical applicationsThe fruit of Eugenia uniflora contains phenolic and carotenoid compounds with significant nutraceutical potential. However, the production of this fruit is seasonal, necessitating effective preservation methods. Drying, which involves the removal of water from the pulp, is a common procedure aimed at extending the fruit's conservation and shelf life. Despite its benefits, the drying process poses a challenge, as bioactive compounds like phenolic and carotenoids are sensitive to thermal processing. Their degradation during drying can lead to a reduction in the fruit's bioactive potential. Therefore, it is crucial to understand the kinetics of drying and the degradation of these bioactive compounds to optimize the drying process and maximize the fruit's nutraceutical value.

Read full abstract

High R2 Values Research Articles

Related Topics

Articles published on High R2 Values

Interpretable machine learning methods to predict the mechanical properties of ABX3 perovskites

Enhancing Solar Power Efficiency: Smart Metering and ANN-Based Production Forecasting

Efficacy Assessment of Molecularly Imprinted Polymer Incorporated with Starch and Macadamia Capped Silver Nanoparticles for the Removal of Multi‐Class Pharmaceuticals in Wastewater

Prediction of fresh herbage yield using data mining techniques with limited plant quality parameters

Optimization and prediction of biogas yield from pretreated Ulva Intestinalis Linnaeus applying statistical-based regression approach and machine learning algorithms

Trait imputation enhances nonlinear genetic prediction for some traits.

Comparative study of five machine learning algorithms on prediction of the height of the water-conducting fractured zone in undersea mining

Multi-objective optimization of an EDM process for Monel K-500 alloy using response surface methodology-multi-objective dragonfly algorithm

From organic waste to renewable energy: response surface methodology approach for optimized biodiesel production from palm weevil larvae (Rhynchophorus ferrugineus)

Combining ACE, PLS-R, and SVM-R for rapid detection of adulteration in saffron samples by diffuse reflectance infrared fourier transform spectroscopy

Enhancing Stock Price Prediction in the Indonesian Market: a Concave LSTM Approach with RunReLU

Effect of Ultrasound and Osmotic Dehydration as Pretreatments on the Infrared Drying of Banana Slices.

Optimizing nitrite content in powders from plasma-activated egg whites for meat preservation using response surface methodology

Valorization of coffee agro-industrial residue for biochar production: Use as adsorbent for methylene blue removal

The influence of data aggregation levels on accident severity researches

Effect of drying on bioactive compounds in Eugenia uniflora fruit pulp

The Effect of Project Management System Implementation, BIM Technology, and Cloud Collaboration on Construction Project Efficiency in Riau

Grey-box solution for predicting thermo-mechanical response of rocks

REMOVAL OF VARIOUS METAL IONS IN WATER BY DIFFERENT PRE-TREATMENTS OF FLY ASH

Parametric modeling of resin-bonded sand mold systems using machine learning-based approaches

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

High R2 Values Research Articles

Related Topics

Articles published on High R2 Values

Interpretable machine learning methods to predict the mechanical properties of ABX3 perovskites

Enhancing Solar Power Efficiency: Smart Metering and ANN-Based Production Forecasting

Efficacy Assessment of Molecularly Imprinted Polymer Incorporated with Starch and Macadamia Capped Silver Nanoparticles for the Removal of Multi‐Class Pharmaceuticals in Wastewater

Prediction of fresh herbage yield using data mining techniques with limited plant quality parameters

Optimization and prediction of biogas yield from pretreated Ulva Intestinalis Linnaeus applying statistical-based regression approach and machine learning algorithms

Trait imputation enhances nonlinear genetic prediction for some traits.

Comparative study of five machine learning algorithms on prediction of the height of the water-conducting fractured zone in undersea mining

Multi-objective optimization of an EDM process for Monel K-500 alloy using response surface methodology-multi-objective dragonfly algorithm

From organic waste to renewable energy: response surface methodology approach for optimized biodiesel production from palm weevil larvae (Rhynchophorus ferrugineus)

Combining ACE, PLS-R, and SVM-R for rapid detection of adulteration in saffron samples by diffuse reflectance infrared fourier transform spectroscopy

Enhancing Stock Price Prediction in the Indonesian Market: a Concave LSTM Approach with RunReLU

Effect of Ultrasound and Osmotic Dehydration as Pretreatments on the Infrared Drying of Banana Slices.

Optimizing nitrite content in powders from plasma-activated egg whites for meat preservation using response surface methodology

Valorization of coffee agro-industrial residue for biochar production: Use as adsorbent for methylene blue removal

The influence of data aggregation levels on accident severity researches

Effect of drying on bioactive compounds in Eugenia uniflora fruit pulp

The Effect of Project Management System Implementation, BIM Technology, and Cloud Collaboration on Construction Project Efficiency in Riau

Grey-box solution for predicting thermo-mechanical response of rocks

REMOVAL OF VARIOUS METAL IONS IN WATER BY DIFFERENT PRE-TREATMENTS OF FLY ASH

Parametric modeling of resin-bonded sand mold systems using machine learning-based approaches