Abstract

Microbial diversity has always presented taxonomic challenges. With the popularity of next-generation sequencing technology, more unculturable bacteria have been sequenced, facilitating the discovery of additional new species and complicated current microbial classification. The major challenge is to assign appropriate taxonomic names. Hence, assessing the consistency between taxonomy and genomic relatedness is critical. We proposed and applied a genome comparison approach to a large-scale survey to investigate the distribution of genomic differences among microorganisms. The approach applies a genome-wide criterion, homologous coverage ratio (HCR), for describing the homology between species. The survey included 7861 microbial genomes that excluded plasmids, and 1220 pairs of genera exhibited ambiguous classification. In this study, we also compared the performance of HCR and average nucleotide identity (ANI). The results indicated that HCR and ANI analyses yield comparable results, but a few examples suggested that HCR has a superior clustering effect. In addition, we used the Genome Taxonomy Database (GTDB), the gold standard for taxonomy, to validate our analysis. The GTDB offers 120 ubiquitous single-copy proteins as marker genes for species classification. We determined that the analysis of the GTDB still results in classification boundary blur between some genera and that the marker gene-based approach has limitations. Although the choice of marker genes has been quite rigorous, the bias of marker gene selection remains unavoidable. Therefore, methods based on genomic alignment should be considered for use for species classification in order to avoid the bias of marker gene selection. On the basis of our observations of microbial diversity, microbial classification should be re-examined using genome-wide comparisons.

Highlights

  • In the higher-level classification of organisms, species taxonomy is consistent with phylogenetic inferences—species with similar genotypes should have similar phenotypes

  • Such a philosophy of species classification is constantly being applied to microorganisms

  • Microbial diversity is attributable to the need of microorganisms to rapidly mutate to adapt to the environment under different environmental conditions [1]

Read more

Summary

Introduction

In the higher-level classification of organisms, species taxonomy is consistent with phylogenetic inferences—species with similar genotypes should have similar phenotypes. Such a philosophy of species classification is constantly being applied to microorganisms. Whole-genome alignment is another method of microbial classification, and the terminology of the overall genome association index (OGRI) was proposed in 2014 [7]. Distances in homologous relationships are evaluated on the basis of the results of whole-genome comparisons, and this eliminates the bias that can be caused by the presence of multiple alignments and the selection of different marker gene combinations. We used the sequence alignment tool to perform a more precise genomic comparison and ascertain the evidence of ambiguous microbial classification

Data Description
ANI and Other Tools
Large-Scale Genome Comparison
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call