Abstract

Recently, the diversity of the Mycobacterium tuberculosis complex (MTBC) population structure has been described in detail. Based on geographical separation and specific host pathogen co-evolution shaping MTBC virulence traits, at least 20 major lineages/genotypes have evolved finally leading to a clear influence of strain genetic background on transmissibility, clinical presentation/outcome, and resistance development. Therefore, high resolution genotyping for characterization of strains in larger studies is mandatory for understanding mechanisms of host-pathogen-interaction and to improve tuberculosis (TB) control. Single nucleotide polymorphisms (SNPs) represent the most reliable markers for lineage classification of clinical isolates due to the low levels of homoplasy, however their use is hampered either by low discriminatory power or by the need to analyze a large number of genes to achieve higher resolution. Therefore, we carried out de novo sequencing of 26 genes (approx. 20000 bp per strain) in a reference collection of MTBC strains including all major genotypes to define a highly discriminatory gene set. Overall, 161 polymorphisms were detected of which 59 are genotype-specific, while 13 define deeper branches such as the Euro-American lineage. Unbiased investigation of the most variable set of 11 genes in a population based strain collection (one year, city of Hamburg, Germany) confirmed the validity of SNP analysis as all strains were classified with high accuracy. Taken together, we defined a diagnostic algorithm which allows the identification of 17 MTBC phylogenetic lineages with high confidence for the first time by sequencing analysis of just five genes. In conclusion, the diagnostic algorithm developed in our study is likely to open the door for a low cost high resolution sequence/SNP based differentiation of the MTBC with a very high specificity. High throughput assays can be established which will be needed for large association studies that are mandatory for detailed investigation of host-pathogen-interaction during TB infection.

Highlights

  • With one-third of the world’s population infected and approximately 2 million individuals dying of tuberculosis (TB) annually, bacteria of the Mycobacterium tuberculosis complex (MTBC) are among the most harmful pathogens [1]

  • Bacterial Strains Sequence analysis of 26 genes (Table 1) was carried out in 65 clinical isolates (Table 2) of the mycobacterial interspersed repetitive units (MIRU)-VNTRplus reference collection [14,19] as well as the ATCC reference strains H37Rv (ATCC 27294), M. africanum (ATCC 25420), and M. bovis (ATCC 19210). 21 different species/genotypes were represented by three clinical isolates

  • All strains were previously classified into phylogenetic lineages based on spoligotyping, MIRU-variable number of tandem repeats (VNTR) and deletion typing [19]

Read more

Summary

Introduction

With one-third of the world’s population infected and approximately 2 million individuals dying of tuberculosis (TB) annually, bacteria of the Mycobacterium tuberculosis complex (MTBC) are among the most harmful pathogens [1]. The interaction of M. tuberculosis with HIV and the emergence of multidrug-resistant strains are further accelerating the TB epidemic [2]. The MTBC comprises seven closely related species with distinct host tropism: M. tuberculosis, M. africanum and M. canettii strains are obligate human pathogens, while M. bovis (bovine), M. caprae (goats), M. microti (rodents) and M. pinnipedii (seals) are considered as animal adapted organism [4,5,6]. Recent studies demonstrate that these species can be subdivided in manifold phylogenetic lineages that show significant levels of functional genome variation and pathobiological characteristics [5,6,7,8]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call