Wrapper Feature Selection Method Research Articles

As climate change intensifies, the frequency and severity of waterlogging are expected to increase, necessitating a deeper understanding of the cucumber response to this stress. In this study, three public RNA-seq datasets (PRJNA799460, PRJNA844418, and PRJNA678740) comprising 36 samples were analyzed. Various feature selection algorithms including Uncertainty, Relief, SVM (Support Vector Machine), Correlation, and logistic least absolute shrinkage, and selection operator (LASSO) were performed to identify the most significant genes related to the waterlogging stress response. These feature selection techniques, which have different characteristics, were used to reduce the complexity of the data and thereby identify the most significant genes related to the waterlogging stress response. Uncertainty, Relief, SVM, Correlation, and LASSO identified 4, 4, 10, 21, and 13 genes, respectively. Differential gene correlation analysis (DGCA) focusing on the 36 selected genes identified changes in correlation patterns between the selected genes under waterlogged versus control conditions, providing deeper insights into the regulatory networks and interactions among the selected genes. DGCA revealed significant changes in the correlation of 13 genes between control and waterlogging conditions. Finally, we validated 13 genes using the Random Forest (RF) classifier, which achieved 100% accuracy and a 1.0 Area Under the Curve (AUC) score. The SHapley Additive exPlanations (SHAP) values clearly showed the significant impact of LOC101209599, LOC101217277, and LOC101216320 on the model’s predictive power. In addition, we employed the Boruta as a wrapper feature selection method to further validate our gene selection strategy. Eight of the 13 genes were common across the four feature weighting algorithms, LASSO, DGCA, and Boruta, underscoring the robustness and reliability of our gene selection strategy. Notably, the genes LOC101209599, LOC101217277, and LOC101216320 were among genes identified by multiple feature selection methods from different categories (filtering, wrapper, and embedded). Pathways associated with these specific genes play a pivotal role in regulating stress tolerance, root development, nutrient absorption, sugar metabolism, gene expression, protein degradation, and calcium signaling. These intricate regulatory mechanisms are crucial for cucumbers to adapt effectively to waterlogging conditions. These findings provide valuable insights for uncovering targets in breeding new cucumber varieties with enhanced stress tolerance.

Read full abstract

Percutaneous endoscopic lumbar discectomy (PELD) is one of the main means of minimally invasive spinal surgery, and is an effective means of treating lumbar disc herniation, but its early recurrence is still difficult to predict. With the development of machine learning technology, the auxiliary model based on the prediction of early recurrent lumbar disc herniation (rLDH) and the identification of causative risk factors have become urgent problems in current research. However, the screening ability of current models for key factors affecting the prediction of rLDH, as well as their predictive ability, needs to be improved. Therefore, this paper presents a classification model that utilizes wrapper feature selection, developed through the integration of an enhanced bat algorithm (BDGBA) and support vector machine (SVM). Among them, BDGBA increases the population diversity and improves the population quality by introducing directional mutation strategy and guidance-based strategy, which in turn allows the model to secure better subsets of features. Furthermore, SVM serves as the classifier for the wrapper feature selection method, with its classification prediction results acting as a fitness function for the feature subset. In the proposed prediction method, BDGBA is used as an optimizer for feature subset filtering and as an objective function for feature subset evaluation based on the classification results of the support vector machine, which improves the interpretability and prediction accuracy of the model. In order to verify the performance of the proposed method, this paper proves the performance of the model through global optimization experiments and prediction experiments on real data sets. The accuracy of the proposed rLDH prediction model is 93.49% and sensitivity is 88.33%. The experimental results show that Level of herniated disk, Modic change, Disk height, Disk length, and Disk width are the key factors for predicting rLDH, and the proposed method is an effective auxiliary diagnosis method.

Read full abstract

Wrapper Feature Selection Method Research Articles

Related Topics

Articles published on Wrapper Feature Selection Method

LLpowershap: logistic loss-based automated Shapley values feature selection method

IMOABC: An efficient multi-objective filter–wrapper hybrid approach for high-dimensional feature selection

A wrapper feature selection approach using Markov blankets

Fault identification of rolling bearing based on improved salp swarm algorithm

A Machine Learning-Based Wrapper Method for Feature Selection

Optimizing Fall Risk Diagnosis in Older Adults Using a Bayesian Classifier and Simulated Annealing.

Uncovering waterlogging-responsive genes in cucumber through machine learning and differential gene correlation analysis

Financial distress prediction using an improved particle swarm optimization wrapper feature selection method and tree boosting ensemble

Advancing XSS Detection in IoT over 5G: A Cutting-Edge Artificial Neural Network Approach

Optimizing prediction accuracy for early recurrent lumbar disc herniation with a directional mutation-guided SVM model

Deep Curious Feature Selection: A Recurrent, Intrinsic-Reward Reinforcement Learning Approach to Feature Selection

Hybrid wrapper feature selection method based on genetic algorithm and extreme learning machine for intrusion detection

Embedded Feature Selection Approach Using Penalized Logistic Regression for Universal Steganalysis

Machine learning model for predicting immediate postoperative desaturation using spirometry signal data

Hybrid distributed feature selection using particle swarm optimization-mutual information

Improved feature selection using a hybrid side-blotched lizard algorithm and genetic algorithm approach

End-to-end lightweight berry number prediction for supporting table grape cultivation

Filter and Wrapper Stacking Ensemble (FWSE): a robust approach for reliable biomarker discovery in high-dimensional omics data.

Prediction of Pavement Overall Condition Index Based on Wrapper Feature-Selection Techniques Using Municipal Pavement Data

An Empirical Evaluation of Feature Selection Stability and Classification Accuracy

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Wrapper Feature Selection Method Research Articles

Related Topics

Articles published on Wrapper Feature Selection Method

LLpowershap: logistic loss-based automated Shapley values feature selection method

IMOABC: An efficient multi-objective filter–wrapper hybrid approach for high-dimensional feature selection

A wrapper feature selection approach using Markov blankets

Fault identification of rolling bearing based on improved salp swarm algorithm

A Machine Learning-Based Wrapper Method for Feature Selection

Optimizing Fall Risk Diagnosis in Older Adults Using a Bayesian Classifier and Simulated Annealing.

Uncovering waterlogging-responsive genes in cucumber through machine learning and differential gene correlation analysis

Financial distress prediction using an improved particle swarm optimization wrapper feature selection method and tree boosting ensemble

Advancing XSS Detection in IoT over 5G: A Cutting-Edge Artificial Neural Network Approach

Optimizing prediction accuracy for early recurrent lumbar disc herniation with a directional mutation-guided SVM model

Deep Curious Feature Selection: A Recurrent, Intrinsic-Reward Reinforcement Learning Approach to Feature Selection

Hybrid wrapper feature selection method based on genetic algorithm and extreme learning machine for intrusion detection

Embedded Feature Selection Approach Using Penalized Logistic Regression for Universal Steganalysis

Machine learning model for predicting immediate postoperative desaturation using spirometry signal data

Hybrid distributed feature selection using particle swarm optimization-mutual information

Improved feature selection using a hybrid side-blotched lizard algorithm and genetic algorithm approach

End-to-end lightweight berry number prediction for supporting table grape cultivation

Filter and Wrapper Stacking Ensemble (FWSE): a robust approach for reliable biomarker discovery in high-dimensional omics data.

Prediction of Pavement Overall Condition Index Based on Wrapper Feature-Selection Techniques Using Municipal Pavement Data

An Empirical Evaluation of Feature Selection Stability and Classification Accuracy