Abstract

The high variability of the human immunodeficiency virus (HIV) is an important cause of HIV resistance to reverse transcriptase and protease inhibitors. There are many variants of HIV type 1 (HIV-1) that can be used to model sequence-resistance relationships. Machine learning methods are widely and successfully used in new drug discovery. An emerging body of data regarding the interactions of small drug-like molecules with their protein targets provides the possibility of building models on “structure-property” relationships and analyzing the performance of various machine-learning techniques. In our research, we analyze several different types of descriptors in order to predict the resistance of HIV reverse transcriptase and protease to the marketed antiretroviral drugs using the Random Forest approach. First, we represented amino acid sequences as a set of short peptide fragments, which included several amino acid residues. Second, we represented nucleotide sequences as a set of fragments, which included several nucleotides. We compared these two approaches using open data from the Stanford HIV Drug Resistance Database. We have determined the factors that modulate the performance of prediction: in particular, we observed that the prediction performance was more sensitive to certain drugs than a type of the descriptor used.

Highlights

  • The human immunodeficiency virus (HIV)/AIDS pandemic is one of the most important challenges facing humanity

  • As to the resistance to protease inhibitors (PR) inhibitors, in general, the performance based on the nucleotide descriptors was not higher in comparison to that based on the short peptides; the performance of prediction was more dependent on the drug than on the type of the descriptors

  • We demonstrated the use of the Random Forest approach for the modeling of human immunodeficiency virus type 1 (HIV-1) resistance to reverse transcriptase and protease inhibitors based on the descriptors represented by small peptide and/or nucleotide fragments

Read more

Summary

Introduction

The human immunodeficiency virus (HIV)/AIDS pandemic is one of the most important challenges facing humanity. HIV-1 exhibits high mutation rates and a great ability to recombination These two factors can cause the high level of HIV-1 resistance, which leads to the necessity of HIV treatment with several combinations of antiretroviral drugs. Some of them have the ultimate goal of predicting HIV-1 resistance to reverse transcriptase (RT) and protease inhibitors (PR) [2,3,4,5,6,7,8,9,10,11,12,13] based on amino acid or nucleotide sequences of HIV RT and PR. An application of the Decision Trees methodology was implemented in the Geno2Pheno web-tool [4]

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.