Multimodal AI/ML for discovering novel biomarkers and predicting disease using multi-omics profiles of patients with cardiovascular diseases

William Degroat,Habiba Abdelhalim,Elizabeth Peker,Neev Sheth,Rishabh Narayanan,Saman Zeeshan,Bruce T Liang,Zeeshan Ahmed

doi:10.1038/s41598-024-78553-6

Abstract

Cardiovascular diseases (CVDs) are complex, multifactorial conditions that require personalized assessment and treatment. Advancements in multi-omics technologies, namely RNA sequencing and whole-genome sequencing, have provided translational researchers with a comprehensive view of the human genome. The efficient synthesis and analysis of this data through integrated approach that characterizes genetic variants alongside expression patterns linked to emerging phenotypes, can reveal novel biomarkers and enable the segmentation of patient populations based on personalized risk factors. In this study, we present a cutting-edge methodology rooted in the integration of traditional bioinformatics, classical statistics, and multimodal machine learning techniques. Our approach has the potential to uncover the intricate mechanisms underlying CVD, enabling patient-specific risk and response profiling. We sourced transcriptomic expression data and single nucleotide polymorphisms (SNPs) from both CVD patients and healthy controls. By integrating these multi-omics datasets with clinical demographic information, we generated patient-specific profiles. Utilizing a robust feature selection approach, we identified a signature of 27 transcriptomic features and SNPs that are effective predictors of CVD. Differential expression analysis, combined with minimum redundancy maximum relevance feature selection, highlighted biomarkers that explain the disease phenotype. This approach prioritizes both biological relevance and efficiency in machine learning. We employed Combination Annotation Dependent Depletion scores and allele frequencies to identify variants with pathogenic characteristics in CVD patients. Classification models trained on this signature demonstrated high-accuracy predictions for CVD. The best performing of these models was an XGBoost classifier optimized via Bayesian hyperparameter tuning, which was able to correctly classify all patients in our test dataset. Using SHapley Additive exPlanations, we created risk assessments for patients, offering further contextualization of these predictions in a clinical setting. Across the cohort, RPL36AP37 and HBA1 were scored as the most important biomarkers for predicting CVDs. A comprehensive literature review revealed that a substantial portion of the diagnostic biomarkers identified have previously been associated with CVD. The framework we propose in this study is unbiased and generalizable to other diseases and disorders.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Multimodal AI/ML for discovering novel biomarkers and predicting disease using multi-omics profiles of patients with cardiovascular diseases

Abstract

Published Version

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Nov 3, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

On some aspects of minimum redundancy maximum relevance feature selection
Peter Bugata ... Peter Drotar
Science China Information Sciences | VOL. 63
Peter Bugata, et. al.Peter Bugata ... Peter Drotar
24 Dec 2019
Science China Information Sciences | VOL. 63

Detection of lung cancer on chest CT images using minimum redundancy maximum relevance feature selection method with convolutional neural networks
Mesut Toğaçar ... Zafer Cömert
Biocybernetics and Biomedical Engineering | VOL. 40
Mesut Toğaçar, et. al.Mesut Toğaçar ... Zafer Cömert
23 Nov 2019
Biocybernetics and Biomedical Engineering | VOL. 40

Machine Learning Based Approaches for Prediction of Parkinson's Disease
Arvind Kumar Tiwari
Machine Learning and Applications: An International Journal | VOL. 3
Arvind Kumar TiwariArvind Kumar Tiwari
30 Jun 2016
Machine Learning and Applications: An International Journal | VOL. 3

Prevention of Cardiovascular Disease in Persons with Type 2 Diabetes Mellitus: Current Knowledge and Rationale for the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Trial
David C Goff ... Denise G Simons-Morton
The American Journal of Cardiology | VOL. 99
David C Goff, et. al.David C Goff ... Denise G Simons-Morton
12 Apr 2007
The American Journal of Cardiology | VOL. 99

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Multimodal AI/ML for discovering novel biomarkers and predicting disease using multi-omics profiles of patients with cardiovascular diseases

Abstract

Published Version

Talk to us

Similar Papers

More From: Scientific Reports