Abstract

BackgroundDeep sequencing of targeted genomic regions is becoming a common tool for understanding the dynamics and complexity of Plasmodium infections, but its lower limit of detection is currently unknown. Here, a new amplicon analysis tool, the Parallel Amplicon Sequencing Error Correction (PASEC) pipeline, is used to evaluate the performance of amplicon sequencing on low-density Plasmodium DNA samples. Illumina-based sequencing of two Plasmodium falciparum genomic regions (CSP and SERA2) was performed on two types of samples: in vitro DNA mixtures mimicking low-density infections (1–200 genomes/μl) and extracted blood spots from a combination of symptomatic and asymptomatic individuals (44–653,080 parasites/μl). Three additional analysis tools—DADA2, HaplotypR, and SeekDeep—were applied to both datasets and the precision and sensitivity of each tool were evaluated.ResultsAmplicon sequencing can contend with low-density samples, showing reasonable detection accuracy down to a concentration of 5 Plasmodium genomes/μl. Due to increased stochasticity and background noise, however, all four tools showed reduced sensitivity and precision on samples with very low parasitaemia (< 5 copies/μl) or low read count (< 100 reads per amplicon). PASEC could distinguish major from minor haplotypes with an accuracy of 90% in samples with at least 30 Plasmodium genomes/μl, but only 61% at low Plasmodium concentrations (< 5 genomes/μl) and 46% at very low read counts (< 25 reads per amplicon). The four tools were additionally used on a panel of extracted parasite-positive blood spots from natural malaria infections. While all four identified concordant patterns of complexity of infection (COI) across four sub-Saharan African countries, COI values obtained for individual samples differed in some cases.ConclusionsAmplicon deep sequencing can be used to determine the complexity and diversity of low-density Plasmodium infections. Despite differences in their approach, four state-of-the-art tools resolved known haplotype mixtures with similar sensitivity and precision. Researchers can therefore choose from multiple robust approaches for analysing amplicon data, however, error filtration approaches should not be uniformly applied across samples of varying parasitaemia. Samples with very low parasitaemia and very low read count have higher false positive rates and call for read count thresholds that are higher than current default recommendations.

Highlights

  • Deep sequencing of targeted genomic regions is becoming a common tool for understanding the dynamics and complexity of Plasmodium infections, but its lower limit of detection is currently unknown

  • The results show that low parasitaemia does not preclude amplicon analysis of P. falciparum samples, researchers should expect reduced sensitivity and reduced precision with low read-count samples (< 100 reads/amplicon) and at parasite DNA concentrations under 5 genomes/μl

  • Sequencing coverage for low‐density mock infections and natural infections from sub‐Saharan Africa In total, 148 DNA mixtures of known haplotypic composition, 190 blood samples from sub-Saharan Africa, 12 positive-control plasmid samples, and 4 negative-control samples without Plasmodium DNA were PCR amplified for CSP and SERA2 and sequenced on a single Illumina MiSeq run

Read more

Summary

Introduction

Deep sequencing of targeted genomic regions is becoming a common tool for understanding the dynamics and complexity of Plasmodium infections, but its lower limit of detection is currently unknown. When targeting a highly polymorphic genomic region, a single amplicon can distinguish among hundreds of unique haplotypes [1], providing higher resolution than either SNP-based or length-based genotyping approaches This improves estimates of the number of lineages within polyclonal infections (or complexity of infection; COI) [2,3,4], permits the discovery of unknown alleles [5,6,7], and provides increased information for haplotype-based analyses of epistasis and linkage disequilibrium [8]. Amplicon sequencing has high sensitivity for the detection of minority parasite lineages within an infection, and is of particular interest in longitudinal studies that track intra-host dynamics [3, 4]

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.