Deep Forest Research Articles

Background: The use of machine learning models in sequence-based Protein-Protein Interaction prediction typically requires the conversion of amino acid sequences into feature vectors. From the literature, two approaches have been used to achieve this transformation. These are referred to as the Independent Protein Feature (IPF) and Merged Protein Feature (MPF) extraction methods. As observed, studies have predominantly adopted the IPF approach, while others preferred the MPF method, in which host and pathogen sequences are concatenated before feature encoding. Objective: This presents the challenge of determining which approach should be adopted for improved HPPPI prediction. Therefore, this work introduces the Extended Protein Feature (EPF) method. Methods: The proposed method combines the predictive capabilities of IPF and MPF, extracting essential features, handling multicollinearity, and removing features with zero importance. EPF, IPF, and MPF were tested using bacteria, parasite, virus, and plant HPPPI datasets and were deployed to machine learning models, including Random Forest (RF), Support Vector Machine (SVM), Multilayer Perceptron (MLP), Naïve Bayes (NB), Logistic Regression (LR), and Deep Forest (DF). Results: The results indicated that MPF exhibited the lowest performance overall, whereas IPF performed better with decision tree-based models, such as RF and DF. In contrast, EPF demonstrated improved performance with SVM, LR, NB, and MLP and also yielded competitive results with DF and RF. Conclusion: In conclusion, the EPF approach developed in this study exhibits substantial improvements in four out of the six models evaluated. This suggests that EPF offers competitiveness with IPF and is particularly well-suited for traditional machine learning models.

This study aims to construct a predictive model based on machine learning algorithms to assess the risk of prolonged hospital stays post-surgery for colorectal cancer patients and to analyze preoperative and postoperative factors associated with extended hospitalization. We prospectively collected clinical data from 83 colorectal cancer patients. The study included 40 variables (comprising 39 predictor variables and 1 target variable). Important variables were identified through variable selection via the Lasso regression algorithm, and predictive models were constructed using ten machine learning models, including Logistic Regression, Decision Tree, Random Forest, Support Vector Machine, Light Gradient Boosting Machine, KNN, and Extreme Gradient Boosting, Categorical Boosting, Artificial Neural Network and Deep Forest. The model performance was evaluated using Bootstrap ROC curves and calibration curves, with the optimal model selected and further interpreted using the SHAP explainability algorithm. Ten significantly correlated important variables were identified through Lasso regression, validated by 1000 Bootstrap resamplings, and represented through Bootstrap ROC curves. The Logistic Regression model achieved the highest AUC (AUC=0.99, 95% CI=0.97-0.99). The explainable machine learning algorithm revealed that the distance walked on the third day post-surgery was the most important variable for the LR model. This study successfully constructed a model predicting postoperative hospital stay duration using patients' clinical data. This model promises to provide healthcare professionals with a more precise prediction tool in clinical practice, offering a basis for personalized nursing interventions, thereby improving patient prognosis and quality of life and enhancing the efficiency of medical resource utilization.

Deep Forest Research Articles

Related Topics

Articles published on Deep Forest

DPI_CDF: druggable protein identifier using cascade deep forest

HIERARCHICAL CLUSTERIZATION AND DEEP LEARNING MODEL RANDOM FOREST OF BANKS’ STABILITY UNDER RISK CONDITIONS

Forest Cows Secrets: Cracking the Code With Movement Sensors

Fracture identification of carbonate reservoirs by deep forest model: An example from the D oilfield in Zagros Basin

An Extended Feature Representation Technique for Predicting Sequenced-based Host-pathogen Protein-protein Interaction

Multi-view uncertainty deep forest: An innovative deep forest equipped with uncertainty estimation for drug-induced liver injury prediction

Screening androgen receptor agonists of fish species using machine learning and molecular model in NORMAN water-relevant list

An effective deep learning scheme for android malware detection leveraging performance metrics and computational resources

High-spatial-resolution surface soil moisture retrieval using the Deep Forest model in the cloud environment over the Tibetan Plateau

Location-Aware Deep Interaction Forest for Web Service QoS Prediction

TSCF: An Improved Deep Forest Model for Time Series Classification

Qualitatively and quantitatively explore injury severity of light motor vehicle drivers involved in heavy goods vehicle crashes

Interpreting Deep Forest through Feature Contribution and MDI Feature Importance

Application research of credit fraud detection based on distributed rotation deep forest

Identifying Antitubercular Peptides via Deep Forest Architecture with Effective Feature Representation.

APDF: An active preference-based deep forest expert system for overall survival prediction in gastric cancer

Construction of a predictive model for postoperative hospitalization time in colorectal cancer patients based on interpretable machine learning algorithm: a prospective preliminary study.

A Study on Analyzing Symbolism in Literature by Combining Computer Vision Techniques

TDFFM: Transformer and Deep Forest Fusion Model for Predicting Coronavirus 3C-Like Protease Cleavage Sites.

A Distance Transformation Deep Forest Framework With Hybrid-Feature Fusion for CXR Image Classification.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Forest Research Articles

Related Topics

Articles published on Deep Forest

DPI_CDF: druggable protein identifier using cascade deep forest

HIERARCHICAL CLUSTERIZATION AND DEEP LEARNING MODEL RANDOM FOREST OF BANKS’ STABILITY UNDER RISK CONDITIONS

Forest Cows Secrets: Cracking the Code With Movement Sensors

Fracture identification of carbonate reservoirs by deep forest model: An example from the D oilfield in Zagros Basin

An Extended Feature Representation Technique for Predicting Sequenced-based Host-pathogen Protein-protein Interaction

Multi-view uncertainty deep forest: An innovative deep forest equipped with uncertainty estimation for drug-induced liver injury prediction

Screening androgen receptor agonists of fish species using machine learning and molecular model in NORMAN water-relevant list

An effective deep learning scheme for android malware detection leveraging performance metrics and computational resources

High-spatial-resolution surface soil moisture retrieval using the Deep Forest model in the cloud environment over the Tibetan Plateau

Location-Aware Deep Interaction Forest for Web Service QoS Prediction

TSCF: An Improved Deep Forest Model for Time Series Classification

Qualitatively and quantitatively explore injury severity of light motor vehicle drivers involved in heavy goods vehicle crashes

Interpreting Deep Forest through Feature Contribution and MDI Feature Importance

Application research of credit fraud detection based on distributed rotation deep forest

Identifying Antitubercular Peptides via Deep Forest Architecture with Effective Feature Representation.

APDF: An active preference-based deep forest expert system for overall survival prediction in gastric cancer

Construction of a predictive model for postoperative hospitalization time in colorectal cancer patients based on interpretable machine learning algorithm: a prospective preliminary study.

A Study on Analyzing Symbolism in Literature by Combining Computer Vision Techniques

TDFFM: Transformer and Deep Forest Fusion Model for Predicting Coronavirus 3C-Like Protease Cleavage Sites.

A Distance Transformation Deep Forest Framework With Hybrid-Feature Fusion for CXR Image Classification.