Abstract

CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with cas (CRISPR–associated) genes, form the CRISPR/Cas adaptive immune system, a primary defense strategy that eubacteria and archaea mobilize against foreign nucleic acids, including phages and conjugative plasmids. Short spacer sequences separated by the repeats are derived from foreign DNA and direct interference to future infections. The availability of hundreds of shotgun metagenomic datasets from the Human Microbiome Project (HMP) enables us to explore the distribution and diversity of known CRISPRs in human-associated microbial communities and to discover new CRISPRs. We propose a targeted assembly strategy to reconstruct CRISPR arrays, which whole-metagenome assemblies fail to identify. For each known CRISPR type (identified from reference genomes), we use its direct repeat consensus sequence to recruit reads from each HMP dataset and then assemble the recruited reads into CRISPR loci; the unique spacer sequences can then be extracted for analysis. We also identified novel CRISPRs or new CRISPR variants in contigs from whole-metagenome assemblies and used targeted assembly to more comprehensively identify these CRISPRs across samples. We observed that the distributions of CRISPRs (including 64 known and 86 novel ones) are largely body-site specific. We provide detailed analysis of several CRISPR loci, including novel CRISPRs. For example, known streptococcal CRISPRs were identified in most oral microbiomes, totaling ∼8,000 unique spacers: samples resampled from the same individual and oral site shared the most spacers; different oral sites from the same individual shared significantly fewer, while different individuals had almost no common spacers, indicating the impact of subtle niche differences on the evolution of CRISPR defenses. We further demonstrate potential applications of CRISPRs to the tracing of rare species and the virus exposure of individuals. This work indicates the importance of effective identification and characterization of CRISPR loci to the study of the dynamic ecology of microbiomes.

Highlights

  • CRISPRs, together with cas genes (CRISPR-associated genes), provide acquired resistance against viruses and conjugative plasmids [1,2], and are found in most archaeal (,90%) and bacterial (,40%) genomes [3,4,5]

  • The Human Microbiome Project (HMP) has resulted in .700 datasets of shotgun metagenomic sequences, from which we can learn about the compositions and functions of human-associated microbial communities

  • CRISPR/Cas systems are a widespread class of adaptive immune systems in bacteria and archaea, providing acquired immunity against foreign nucleic acids: CRISPR/Cas defense pathways involve integration of viral- or plasmid-derived DNA segments into CRISPR arrays, and expression of short CRISPR RNAs (crRNAs) from these single repeat-spacer units, to generate interference to future invading foreign genomes

Read more

Summary

Introduction

CRISPRs, together with cas genes (CRISPR-associated genes), provide acquired resistance against viruses and conjugative plasmids [1,2], and are found in most archaeal (,90%) and bacterial (,40%) genomes [3,4,5]. CRISPR arrays consist of 24– 47 bp direct repeats, separated by unique sequences (spacers) that are acquired from viral or plasmid genomes [6]. CRISPR/Cas systems commonly use repeat and spacer-derived short guide CRISPR RNAs (crRNAs) to silence foreign nucleic acids in a sequence-specific manner [8,9]. CRISPR/Cas defense pathways involve several steps, including integration of viral or plasmid DNA-derived spacers into the CRISPR array, expression of short crRNAs consisting of unique single repeat-spacer units, and interference with invading foreign genomes at both the DNA and RNA levels, by mechanisms that are not yet fully understood [8,10]. The diversity of cas genes suggests that multiple pathways have arisen to use the basic information contained in the repeat-spacer units in diverse defense mechanisms. The CRISPR components are evolutionarily closely linked and potentially evolve simultaneously as an intact locus— sequence analysis reveals that the direct repeats in CRISPR locus and the linked cas genes co-evolve under analogous evolutionary pressure [11]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call