Gene Selection Method Research Articles

Abstract Single-cell technologies represent a revolutionary approach to resolving cell-type heterogeneity, identifying cells in specialized states, and detecting rare disease-associated cells. With the cost of single-cell technology decreasing substantially, its integration into clinical studies is gaining momentum. A new computational tool is needed to accommodate different single-cell genomics and clinical data formats while accounting for unwanted confounders. The study aims to develop a tree-based machine learning model to leverage the unprecedented resolution of single-cell multi-omics data for delineating the genomic and phenotypic drivers behind diverse immunotherapy responses. The proposed model is called single-cell analysis of Clinical Tree (scanCT), inspired by the Generalized Unbiased Interaction Detection and Estimation method for unbiased gene and protein feature selection and easy interpretation. The scanCT model learns from the data to select the genomic feature that best splits the cells from distinct clinical responses for each tree node. The confounding factors will be regressors in the nodes but not be used for branch splitting, while gene and protein features of interest will split the tree but not enter the regression model in each node. scanCT is built to be free from the biased selection towards variables of a larger number of categories or values. With tree-pruning and cross-validation, scanCT overcomes the over-fitting issue and enhances model generalization, especially for clinical studies with limited patients. Particularly, scanCT naturally fits the hierarchical cell type relationship and handles marker gene and protein interaction effects efficiently. Our approach was tested on single-cell datasets from B-cell malignancy patients undergoing Chimeric Antigen Receptor (CAR)-T cell therapy. The results from the scanCT are highly interpretable. For instance, each branch is a gene-protein combination profile, and cells are naturally partitioned by clinical association. The linear regressions at each leaf node are the clinical predictions for cells following the splitting criteria. The regression intercept is an average estimation of toxicity (e.g., neurotoxicity) or efficacy after controlling for confounder (e.g., tumor burden). scanCT accommodates categorical or continuous clinical response and survival data and is robust to missing values, a frequent challenge in oncological studies. scanCT represents a significant step forward in single-cell data analysis, which merges complex genotypic and phenotypic information with clinical outcomes. The efficacy and toxicity-associated genomic signatures will inform new manufacturing strategies to optimize CAR-T cell therapy products. The model and clinical association detections are expected to go beyond the B-cell malignancy field to benefit the broader cancer research community. Citation Format: Ye Zheng, Long Nguyen, Peigen Zhou, Alexandre V. Hirayama. ScanCT: A tree-based machine learning model to detect single-cell genomic features associated with clinical outcomes [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 7352.

Read full abstract

Introduction: Drug response prediction, especially in terms of cell viability prediction, is a well-studied research problem with significant implications for personalized medicine. It enables the identification of the most effective drugs based on individual genetic profiles, aids in selecting potential drug candidates, and helps identify biomarkers that predict drug efficacy and toxicity.A deeper investigation on drug response prediction reveals that drugs exert their effects by targeting specific proteins, which in turn perturb related genes in cascading ways. This perturbation affects cellular pathways and regulatory networks, ultimately influencing the cellular response to the drug. Identifying which genes are perturbed and how they interact can provide critical insights into the mechanisms of drug action. Hence, the problem of predicting drug response can be framed as a dual problem involving both the prediction of drug efficacy and the selection of drug-specific genes. Identifying these drug-specific genes (biomarkers) is crucial because they serve as indicators of how the drug will affect the biological system, thereby facilitating both drug response prediction and biomarker discovery.Methods: In this study, we propose DGDRP (Drug-specific Gene selection for Drug Response Prediction), a graph neural network (GNN)-based model that uses a novel rank-and-re-rank process for drug-specific gene selection. DGDRP first ranks genes using a pathway knowledge-enhanced network propagation algorithm based on drug target information, ensuring biological relevance. It then re-ranks genes based on the similarity between gene and drug target embeddings learned from the GNN, incorporating semantic relationships. Thus, our model adaptively learns to select drug mechanism-associated genes that contribute to drug response prediction. This integrated approach not only improves drug response predictions compared to other gene selection methods but also allows for effective biomarker discovery.Discussion: As a result, our approach demonstrates improved drug response predictions compared to other gene selection methods and demonstrates comparability with state-of-the-art deep learning models. Case studies further support our method by showing alignment of selected gene sets with the mechanisms of action of input drugs.Conclusion: Overall, DGDRP represents a deep learning based re-ranking strategy, offering a robust gene selection framework for more accurate drug response prediction. The source code for DGDRP can be found at: https://github.com/minwoopak/heteronet.

Read full abstract

Gene Selection Method Research Articles

Related Topics

Articles published on Gene Selection Method

Transcriptome-based prediction for polygenic traits in rice using different gene subsets

ScPanel: a tool for automatic identification of sparse gene panels for generalizable patient classification using scRNA-seq datasets.

Nature-inspired computing based Non-Hodgkin lymphoma prediction from microarray expression with GA-RFE gene selection method; an experimental histopathological study

Robust and Effective: A Deep Matrix Factorization Framework for Classification.

Comparative genomics analysis to explore the biodiversity and mining novel target genes of Listeria monocytogenes strains from different regions.

Gene selection based on recursive spider wasp optimizer guided by marine predators algorithm

Identification of oxidative stress-related biomarkers in chronic rhinosinusitis with nasal polyps using WGCNA combined with machine learning algorithms

Advancing forensic-based investigation incorporating slime mould search for gene selection of high-dimensional genetic data

GFLASSO-LR: Logistic Regression with Generalized Fused LASSO for Gene Selection in High-Dimensional Cancer Classification

In Silico Identification of Effective Genes for Acute Leukemia Classification Using a Spline Regression-based Framework

Incorporating machine learning and PPI networks to identify mitochondrial fission-related immune markers in abdominal aortic aneurysms

Abstract 7352: ScanCT: A tree-based machine learning model to detect single-cell genomic features associated with clinical outcomes

Bi-level gene selection of cancer by combining clustering and sparse learning

Enhancing Cancer Classification Through Ensemble Machine Learning and Gene Selection Approaches

A comparison of marker gene selection methods for single-cell RNA sequencing data

Machine Learning Methods for Gene Selection in Uveal Melanoma.

Gene panel selection for targeted spatial transcriptomics

DGDRP: drug-specific gene selection for drug response prediction via re-ranking through propagating and learning biological network.

Integrating gene selection and deep learning for enhanced Autisms' disease prediction: a comparative study using microarray data

Hybrid Gene Selection Methods for High-Dimensional Lung Cancer Data Using Improved Arithmetic Optimization Algorithm

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Gene Selection Method Research Articles

Related Topics

Articles published on Gene Selection Method

Transcriptome-based prediction for polygenic traits in rice using different gene subsets

ScPanel: a tool for automatic identification of sparse gene panels for generalizable patient classification using scRNA-seq datasets.

Nature-inspired computing based Non-Hodgkin lymphoma prediction from microarray expression with GA-RFE gene selection method; an experimental histopathological study

Robust and Effective: A Deep Matrix Factorization Framework for Classification.

Comparative genomics analysis to explore the biodiversity and mining novel target genes of Listeria monocytogenes strains from different regions.

Gene selection based on recursive spider wasp optimizer guided by marine predators algorithm

Identification of oxidative stress-related biomarkers in chronic rhinosinusitis with nasal polyps using WGCNA combined with machine learning algorithms

Advancing forensic-based investigation incorporating slime mould search for gene selection of high-dimensional genetic data

GFLASSO-LR: Logistic Regression with Generalized Fused LASSO for Gene Selection in High-Dimensional Cancer Classification

In Silico Identification of Effective Genes for Acute Leukemia Classification Using a Spline Regression-based Framework

Incorporating machine learning and PPI networks to identify mitochondrial fission-related immune markers in abdominal aortic aneurysms

Abstract 7352: ScanCT: A tree-based machine learning model to detect single-cell genomic features associated with clinical outcomes

Bi-level gene selection of cancer by combining clustering and sparse learning

Enhancing Cancer Classification Through Ensemble Machine Learning and Gene Selection Approaches

A comparison of marker gene selection methods for single-cell RNA sequencing data

Machine Learning Methods for Gene Selection in Uveal Melanoma.

Gene panel selection for targeted spatial transcriptomics

DGDRP: drug-specific gene selection for drug response prediction via re-ranking through propagating and learning biological network.

Integrating gene selection and deep learning for enhanced Autisms' disease prediction: a comparative study using microarray data

Hybrid Gene Selection Methods for High-Dimensional Lung Cancer Data Using Improved Arithmetic Optimization Algorithm