Incremental Feature Selection Research Articles

Viral infections significantly impact the immune system, and impact will persist until recovery. However, the influence of severe acute respiratory syndrome coronavirus 2 infection on the homeostatic immune status and secondary immune response in recovered patients remains unclear. To investigate these persistent alterations, we employed five feature-ranking algorithms (LASSO, MCFS, RF, CATBoost, and XGBoost), incremental feature selection, synthetic minority oversampling technique and two classification algorithms (decision tree and k-nearest neighbors) to analyze multi-omics data (surface proteins and transcriptome) from coronavirus disease 2019 (COVID-19) recovered patients and healthy controls post-influenza vaccination. The single-cell multi-omics dataset was divided into five subsets corresponding to five immune cell subtypes: B cells, CD4+ T cells, CD8+ T cells, Monocytes, and Natural Killer cells. Each cell was represented by 28,402 scRNA-seq (RNA) features, 3 Hash Tag Oligo (HTO) features, 138 Cellular indexing of transcriptomes and epitopes by sequencing (CITE) features and 23,569 Single Cell Transform (SCT) features. Some multi-omics markers were identified and effective classifiers were constructed. Our findings indicate a distinct immune status in COVID-19 recovered patients, characterized by low expression of ribosomal protein (RPS26) and high expression of immune cell surface proteins (CD33, CD48). Notably, TMEM176B, a membrane protein, was highly expressed in monocytes of COVID-19 convalescent patients. These observations aid in discerning molecular differences among immune cell subtypes and contribute to understanding the prolonged effects of COVID-19 on the immune system, which is valuable for treating infectious diseases like COVID-19.

Read full abstract

Congenital heart disease (CHD) represents a spectrum of inborn heart defects influenced by genetic and environmental factors. This study advances the field by analyzing gene expression profiles in 21,034 cardiac fibroblasts, 73,296 cardiomyocytes, and 35,673 endothelial cells, utilizing single-cell level analysis and machine learning techniques. Six CHD conditions: dilated cardiomyopathy (DCM), donor hearts (used as healthy controls), hypertrophic cardiomyopathy (HCM), heart failure with hypoplastic left heart syndrome (HF_HLHS), Neonatal Hypoplastic Left Heart Syndrome (Neo_HLHS), and Tetralogy of Fallot (TOF), were investigated for each cardiac cell type. Each cell sample was represented by 29,266 gene features. These features were first analyzed by six feature-ranking algorithms, resulting in several feature lists. Then, these lists were fed into incremental feature selection, containing two classification algorithms, to extract essential gene features and classification rules and build efficient classifiers. The identified essential genes can be potential CHD markers in different cardiac cell types. For instance, the LASSO identified key genes specific to various heart cell types in CHD subtypes. FOXO3 was found to be up-regulated in cardiac fibroblasts for both Dilated and hypertrophic cardiomyopathy. In cardiomyocytes, distinct genes such as TMTC1, ART3, ARHGAP24, SHROOM3, and XIST were linked to dilated cardiomyopathy, Neo-Hypoplastic Left Heart Syndrome, hypertrophic cardiomyopathy, HF-Hypoplastic Left Heart Syndrome, and Tetralogy of Fallot, respectively. Endothelial cell analysis further revealed COL25A1, NFIB, and KLF7 as significant genes for dilated cardiomyopathy, hypertrophic cardiomyopathy, and Tetralogy of Fallot. LightGBM, Catboost, MCFS, RF, and XGBoost further delineated key genes for specific CHD subtypes, demonstrating the efficacy of machine learning in identifying CHD-specific genes. Additionally, this study developed quantitative rules for representing the gene expression patterns related to CHDs. This research underscores the potential of machine learning in unraveling the molecular complexities of CHD and establishes a foundation for future mechanism-based studies.

Read full abstract

Incremental Feature Selection Research Articles

Related Topics

Articles published on Incremental Feature Selection

Machine Learning for Prediction of Resistance Scores in Wheat (Triticum aestivum L.)

Identification of Key Genes in Fetal Gut Development at Single-Cell Level by Exploiting Machine Learning Techniques.

Identification of gene and protein signatures associated with long-term effects of COVID-19 on the immune system after patient recovery by analyzing single-cell multi-omics data using a machine learning approach

Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types.

Matrix-based incremental feature selection method using weight-partitioned multigranulation rough set

Detecting key genes relative expression orderings as biomarkers for machine learning-based intelligent screening and analysis of type 2 diabetes mellitus

Consistency approximation: Incremental feature selection based on fuzzy rough set theory

Cost-effective genomic prediction of critical economic traits in sturgeons through low-coverage sequencing

Identification of RNA‐dependent liquid‐liquid phase separation proteins using an artificial intelligence strategy

Rough set Theory-Based group incremental approach to feature selection

Machine Learning Reveals Impacts of Smoking on Gene Profiles of Different Cell Types in Lung.

Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

Excavation of gene markers associated with pancreatic ductal adenocarcinoma based on interrelationships of gene expression.

Identifying Flare-indicative Photospheric Magnetic Field Parameters from Multivariate Time-series Data of Solar Active Regions

Incremental feature selection for large-scale hierarchical classification with the arrival of new samples

Exploring Prognostic Gene Factors in Breast Cancer via Machine Learning.

Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings.

Predicting Critical Path of Labor Dispute Resolution in Legal Domain by Machine Learning Models Based on SHapley Additive exPlanations and Soft Voting Strategy

Promoter Prediction in Agrobacterium tumefaciens Strain C58 by Using Artificial Intelligence Strategies.

Identification of key genes associated with persistent immune changes and secondary immune activation responses induced by influenza vaccination after COVID-19 recovery by machine learning methods

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Incremental Feature Selection Research Articles

Related Topics

Articles published on Incremental Feature Selection

Machine Learning for Prediction of Resistance Scores in Wheat (Triticum aestivum L.)

Identification of Key Genes in Fetal Gut Development at Single-Cell Level by Exploiting Machine Learning Techniques.

Identification of gene and protein signatures associated with long-term effects of COVID-19 on the immune system after patient recovery by analyzing single-cell multi-omics data using a machine learning approach

Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types.

Matrix-based incremental feature selection method using weight-partitioned multigranulation rough set

Detecting key genes relative expression orderings as biomarkers for machine learning-based intelligent screening and analysis of type 2 diabetes mellitus

Consistency approximation: Incremental feature selection based on fuzzy rough set theory

Cost-effective genomic prediction of critical economic traits in sturgeons through low-coverage sequencing

Identification of RNA‐dependent liquid‐liquid phase separation proteins using an artificial intelligence strategy

Rough set Theory-Based group incremental approach to feature selection

Machine Learning Reveals Impacts of Smoking on Gene Profiles of Different Cell Types in Lung.

Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

Excavation of gene markers associated with pancreatic ductal adenocarcinoma based on interrelationships of gene expression.

Identifying Flare-indicative Photospheric Magnetic Field Parameters from Multivariate Time-series Data of Solar Active Regions

Incremental feature selection for large-scale hierarchical classification with the arrival of new samples

Exploring Prognostic Gene Factors in Breast Cancer via Machine Learning.

Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings.

Predicting Critical Path of Labor Dispute Resolution in Legal Domain by Machine Learning Models Based on SHapley Additive exPlanations and Soft Voting Strategy

Promoter Prediction in Agrobacterium tumefaciens Strain C58 by Using Artificial Intelligence Strategies.

Identification of key genes associated with persistent immune changes and secondary immune activation responses induced by influenza vaccination after COVID-19 recovery by machine learning methods