Predictive genetic panel for adult asthma using machine learning methods

Luciano Gama Da Silva Gomes,Monica Campbell,Camila Alexandrina Figueiredo,Maria Borges Rabêlo De Santana,Adelmir De Souza Machado,Carolina Barbosa Souza Santos,Gabriela Pimentel Pinheiro,Meher Preethi Boorgula,Cinthia Vila Nova Santana,Kathleen C Barnes,Ryan Dos Santos Costa,Rafael Valente Veiga,Álvaro Augusto Souza Da Cruz Filho

doi:10.1016/j.jacig.2024.100282

Luciano Gama Da Silva Gomes, Monica Campbell + Show 11 more

Open Access

https://doi.org/10.1016/j.jacig.2024.100282

Copy DOI

Abstract

Asthma is a chronic inflammatory disease of the airways that is heterogeneous and multifactorial, making its accurate characterization a complex process. Therefore, identifying the genetic variations associated with asthma and discovering the molecular interactions between the omics that confer risk of developing this disease will help us to unravel the biological pathways involved in its pathogenesis. We sought to develop a predictive genetic panel for asthma using machine learning methods. We tested 3 variable selection methods: Boruta's algorithm, the top 200 genome-wide association study markers according to their respective P values, and an elastic net regression. Ten different algorithms were chosen for the classification tests. Apredictive panel was built on the basis of joint scores between the classification algorithms. Two variable selection methods, Boruta and genome-wide association studies, were statistically similar in terms of the average accuracies generated, whereas elastic net had the worst overall performance. The predictive genetic panel was completed with 155 single-nucleotide variants, with 91.18% accuracy, 92.75% sensitivity, and 89.55% specificity using the support vector machine algorithm. The markers used range from known single-nucleotide variants to those not previously described in the literature. Our study shows potential in creating genetic prediction panels with tailored penalties per marker, aiding in the identification of optimal machine learning methods for intricate results. This method is able to classify asthma and nonasthma effectively, proving its potential utility in clinical prediction and diagnosis.

Full Text