Mutant-Bin: Unsupervised Haplotype Estimation of Viral Population Diversity Without Reference Genome

Shruthi Prabhakara,Raj Acharya,Mary Poss,Raunaq Malhotra

doi:10.1089/cmb.2012.0174

Abstract

High genetic variability in viral populations plays an important role in disease progression, pathogenesis, and drug resistance. The last few years has seen significant progress in the development of methods for reconstruction of viral populations using data from next-generation sequencing technologies. These methods identify the differences between individual haplotypes by mapping the short reads to a reference genome. Much less has been published about resolving the population structure when a reference genome is lacking or is not well-defined, which severely limits the application of these new technologies to resolve virus population structure. We describe a computational framework, called Mutant-Bin, for clustering individual haplotypes in a viral population and determining their prevalence based on a set of deep sequencing reads. The main advantages of our method are that: (i) it enables determination of the population structure and haplotype frequencies when a reference genome is lacking; (ii) the method is unsupervised-the number of haplotypes does not have to be specified in advance; and (iii) it identifies the polymorphic sites that co-occur in a subset of haplotypes and the frequency with which they appear in the viral population. The method was evaluated on simulated reads with sequencing errors and 454 pyrosequencing reads from HIV samples. Our method clustered a high percentage of haplotypes with low false-positive rates, even at low genetic diversity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mutant-Bin: Unsupervised Haplotype Estimation of Viral Population Diversity Without Reference Genome

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology

Lead the way for us

Journal: Journal of Computational Biology	Publication Date: Jun 1, 2013
Citations: 6

Similar Papers

Drug resistance: Prevalence and clinical implications during the treatment of chronic hepatitis C infection.
Jean-Michel Pawlotsky
Clinical liver disease | VOL. 1
Jean-Michel PawlotskyJean-Michel Pawlotsky
01 Apr 2012
Clinical liver disease | VOL. 1

Large Disclosing the Nature of Computational Tools for the Analysis of Next Generation Sequencing Data
Francesca Cordero ... Susanna Donatelli
Current Topics in Medicinal Chemistry | VOL. 12
Francesca Cordero, et. al.Francesca Cordero ... Susanna Donatelli
01 Jun 2012
Current Topics in Medicinal Chemistry | VOL. 12

Genetic diversity and population structure in five Inner Mongolia cashmere goat populations using whole-genome genotyping.
Tao Zhang ... Zhiying Wang
Animal bioscience | VOL. 37
Tao Zhang, et. al.Tao Zhang ... Zhiying Wang
01 Apr 2024
Animal bioscience | VOL. 37

Seronegative infection and AIDS caused by an A2 subsubtype HIV-1.
Ana R Cardoso ... Cristina Gonçalves
AIDS | VOL. 18
Ana R Cardoso, et. al.Ana R Cardoso ... Cristina Gonçalves
01 Apr 2004
AIDS | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mutant-Bin: Unsupervised Haplotype Estimation of Viral Population Diversity Without Reference Genome

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Biology