Random Forest Importance Research Articles

ObjectiveTo analyze the epidemiological history, clinical symptoms, laboratory testing parameters of patients with mild and severe COVID-19 infection, and provide a reference for timely judgment of changes in the patients’ conditions and the formulation of epidemic prevention and control strategies.MethodsA retrospective study was conducted in this research, a total of 90 patients with COVID-19 infection who received treatment from January 21 to March 31, 2020 in the Ninth People’s Hospital of Dongguan City were selected as study subject. We analyzed the clinical characteristics of laboratory-confirmed patients with COVID-19, used the oversampling method (SMOTE) to solve the imbalance of categories, and established Lasso-logistic regression and random forest models.ResultsAmong the 90 confirmed COVID-19 cases, 79 were mild and 11 were severe. The average age of the patients was 36.1 years old, including 49 males and 41 females. The average age of severe patients is significantly older than that of mild patients (53.2 years old vs 33.7 years old). The average time from illness onset to hospital admission was 4.1 days and the average actual hospital stay was 18.7 days, both of these time actors were longer for severe patients than for mild patients. Forty-eight of the 90 patients (53.3%) had family cluster infections, which was similar among mild and severe patients. Comorbidities of underlying diseases were more common in severe patients, including hypertension, diabetes and other diseases. The most common symptom was cough [45 (50%)], followed by fever [43 (47.8%)], headache [7 (7.8%)], vomiting [3 (3.3%)], diarrhea [3 (3.3%)], and dyspnea [1 (1.1%)]. The laboratory findings of patients also included leukopenia [13(14.4%)] and lymphopenia (17.8%). Severe patients had a low level of creatine kinase (median 40.9) and a high level of D-dimer. The median NLR of severe patients was 2.82, which was higher than that of mild patients. Logistic regression showed that age, phosphocreatine kinase, procalcitonin, the lymphocyte count of the patient on admission, cough, fatigue, and pharynx dryness were independent predictors of COVID-19 severity. The classification of random forest was predicted and the importance of each variable was displayed. The variable importance of random forest indicates that age, D-dimer, NLR (neutrophil to lymphocyte ratio) and other top-ranked variables are risk factors.ConclusionThe clinical symptoms of COVID-19 patients are non-specific and complicated. Age and the time from onset to admission are important factors that determine the severity of the patient’s condition. Patients with mild illness should be closely monitored to identify those who may become severe. Variables such as age and creatine phosphate kinase selected by logistic regression can be used as important indicators to assess the disease severity of COVID-19 patients. The importance of variables in the random forest further complements the variable feature information.

Read full abstract

Background: Gulf War Illness (GWI) and Chronic Fatigue Syndrome (CFS) are two debilitating disorders that share similar symptoms of chronic pain, fatigue, and exertional exhaustion after exercise. Many physicians continue to believe that both are psychosomatic disorders and to date no underlying etiology has been discovered. As such, uncovering objective biomarkers is important to lend credibility to criteria for diagnosis and to help differentiate the two disorders. Methods: We assessed cognitive differences in 80 subjects with GWI and 38 with CFS by comparing corresponding fMRI scans during 2-back working memory tasks before and after exercise to model brain activation during normal activity and after exertional exhaustion, respectively. Voxels were grouped by the count of total activity into the Automated Anatomical Labeling (AAL) atlas and used in an “ensemble” series of machine learning algorithms to assess if a multi-regional pattern of differences in the fMRI scans could be detected. Results: A K-Nearest Neighbor (70%/81%), Linear Support Vector Machine (SVM) (70%/77%), Decision Tree (82%/82%), Random Forest (77%/78%), AdaBoost (69%/81%), Naïve Bayes (74%/78%), Quadratic Discriminant Analysis (QDA) (73%/75%), Logistic Regression model (82%/82%), and Neural Net (76%/77%) were able to differentiate CFS from GWI before and after exercise with an average of 75% accuracy in predictions across all models before exercise and 79% after exercise. An iterative feature selection and removal process based on Recursive Feature Elimination (RFE) and Random Forest importance selected 30 regions before exercise and 33 regions after exercise that differentiated CFS from GWI across all models, and produced the ultimate best accuracies of 82% before exercise and 82% after exercise by Logistic Regression or Decision Tree by a single model, and 100% before and after exercise when selected by any six or more models. Differential activation on both days included the right anterior insula, left putamen, and bilateral orbital frontal, ventrolateral prefrontal cortex, superior, inferior, and precuneus (medial) parietal, and lateral temporal regions. Day 2 had the cerebellum, left supplementary motor area and bilateral pre- and post-central gyri. Changes between days included the right Rolandic operculum switching to the left on Day 2, and the bilateral midcingulum switching to the left anterior cingulum. Conclusion: We concluded that CFS and GWI are significantly differentiable using a pattern of fMRI activity based on an ensemble machine learning model.

Read full abstract

Random Forest Importance Research Articles

Articles published on Random Forest Importance

Finding the combination of multiple biomarkers to diagnose oral squamous cell carcinoma – A data mining approach

Prognostic value of machine learning for acute heart failure

Towards a software defect proneness model: feature selection

Multidimensional sentiment recognition of film and television scene images

Abstract 9495: Prognostic Value of Machine Learning on Clinical Parameters for Cardiac Prognosis in Patinets with Acute Congestive Heart Failure

Abstract 9727: Predictive Value of Machine Learning on Adenosine Stress Single Photon Emission Computed Tomography for Multivessel Coronary Artery Stenosis

Feature Importance of Acute Rejection among Black Kidney Transplant Recipients by Utilizing Random Forest Analysis: An Analysis of the UNOS Database.

Research on Influencing Factors and Classification of Patients With Mild and Severe COVID-19 Symptoms.

Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite

Landslide susceptibility analyses using Random Forest, C4.5, and C5.0 with balanced and unbalanced datasets

The Optimization Model for Reducing RON Loss in Gasoline Refining Process

Ensemble method based architecture using random forest importance to predict employee’s turn over

Inferring mechanisms of response prioritization on social media under information overload

Effect of Genetic Crossing and Nutritional Management on the Mineral Composition of Carcass, Blood, Leather, and Viscera of Sheep.

Predictive modeling for wine authenticity using a machine learning approach

Landslide Susceptibility Mapping Using Ant Colony Optimization Strategy and Deep Belief Network in Jiuzhaigou Region

Body weight, serum albumin and food intolerance were linked to upper gastrointestinal Crohn's disease: a 7-year retrospective analysis.

Artificial Intelligence in Ovarian Cancer Diagnosis.

Machine Learning Detects Pattern of Differences in Functional Magnetic Resonance Imaging (fMRI) Data between Chronic Fatigue Syndrome (CFS) and Gulf War Illness (GWI).

Unbiased variable importance for random forests

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Random Forest Importance Research Articles

Articles published on Random Forest Importance

Finding the combination of multiple biomarkers to diagnose oral squamous cell carcinoma – A data mining approach

Prognostic value of machine learning for acute heart failure

Towards a software defect proneness model: feature selection

Multidimensional sentiment recognition of film and television scene images

Abstract 9495: Prognostic Value of Machine Learning on Clinical Parameters for Cardiac Prognosis in Patinets with Acute Congestive Heart Failure

Abstract 9727: Predictive Value of Machine Learning on Adenosine Stress Single Photon Emission Computed Tomography for Multivessel Coronary Artery Stenosis

Feature Importance of Acute Rejection among Black Kidney Transplant Recipients by Utilizing Random Forest Analysis: An Analysis of the UNOS Database.

Research on Influencing Factors and Classification of Patients With Mild and Severe COVID-19 Symptoms.

Inferring and analyzing gene regulatory networks from multi-factorial expression data: a complete and interactive suite

Landslide susceptibility analyses using Random Forest, C4.5, and C5.0 with balanced and unbalanced datasets

The Optimization Model for Reducing RON Loss in Gasoline Refining Process

Ensemble method based architecture using random forest importance to predict employee’s turn over

Inferring mechanisms of response prioritization on social media under information overload

Effect of Genetic Crossing and Nutritional Management on the Mineral Composition of Carcass, Blood, Leather, and Viscera of Sheep.

Predictive modeling for wine authenticity using a machine learning approach

Landslide Susceptibility Mapping Using Ant Colony Optimization Strategy and Deep Belief Network in Jiuzhaigou Region

Body weight, serum albumin and food intolerance were linked to upper gastrointestinal Crohn's disease: a 7-year retrospective analysis.

Artificial Intelligence in Ovarian Cancer Diagnosis.

Machine Learning Detects Pattern of Differences in Functional Magnetic Resonance Imaging (fMRI) Data between Chronic Fatigue Syndrome (CFS) and Gulf War Illness (GWI).

Unbiased variable importance for random forests