Abstract

Malaria or Paludism is a tropical disease caused by parasites of the Plasmodium genre and transmitted to humans through the bite of infected mosquitos of the Anopheles genre. This pathology is considered one of the first causes of death in tropical countries and, despite several existing therapies, they have a high toxicity. Computational methods based on Quantitative Structure- Activity Relationship studies have been widely used in drug design work flows. The main goal of the current research is to develop computational models for the identification of antimalarial hit compounds. For this, a data set suitable for the modeling of the antimalarial activity of chemical compounds was compiled from the literature and subjected to a thorough curation process. In addition, the performance of a diverse set of ensemble-based classification methodologies was evaluated and one of these ensembles was selected as the most suitable for the identification of antimalarial hits based on its virtual screening performance. Data curation was conducted to minimize noise. Among the explored ensemble-based methods, the one combining Genetic Algorithms for the selection of the base classifiers and Majority Vote for their aggregation showed the best performance. Our results also show that ensemble modeling is an effective strategy for the QSAR modeling of highly heterogeneous datasets in the discovery of potential antimalarial compounds. It was determined that the best performing ensembles were those that use Genetic Algorithms as a method of selection of base models and Majority Vote as the aggregation method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.