Abstract

The CRISPR-Cas system biologically serves as an adaptive defense mechanism against phages. However, there is growing interest in exploiting the hypervariable nature of the CRISPR locus, often of viral origin, for microbial typing and tracking. Moreover, the spacer content of any given strain provides a phage resistance profile. Large-scale CRISPR typing studies require an efficient method for showcasing CRISPR array similarities across multiple isolates. Historically, CRISPR arrays found in microbes have been represented by colored shapes based on nucleotide sequence identity and, while this approach is now routinely used, only scarce computational resources are available to automate the process, making it very time-consuming for large datasets. To alleviate this tedious task, we introduce CRISPRStudio, a command-line tool developed to accelerate CRISPR analysis and standardize the preparation of CRISPR array figures. It first compares nucleotide spacer sequences present in a dataset and then clusters them based on sequence similarity to assign a meaningful representative color. CRISPRStudio offers versatility to suit different biological contexts by including options such as automatic sorting of CRISPR loci and highlighting of shared spacers, while remaining fast and user-friendly.

Highlights

  • Of all the applications and technologies developed around CRISPR-Cas systems, strain typing is arguably the oldest one, as it capitalizes on the ability of the system to acquire new DNA fragments [1]

  • Discovered as an unusual nucleotide arrangement in the Escherichia coli genome [2,3] and later as a defense mechanism [4], CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with their cas (CRISPR-associated) genes, form the so-called CRISPR-Cas systems. These microbial systems provide resistance against invasion of foreign DNA elements, such as phage genomes and plasmids [4,5], by first incorporating short DNA sequences called spacers [6] which originate from the foreign DNA element, into the CRISPR array

  • In the case of phage infections, the CRISPR-Cas machinery will use these spacers of viral origin as a memory to produce crRNAs that will target subsequent infections by phage genomes carrying the exact spacer sequence

Read more

Summary

Introduction

Of all the applications and technologies developed around CRISPR-Cas systems, strain typing is arguably the oldest one, as it capitalizes on the ability of the system to acquire new DNA fragments [1]. For reporting purposes, the CRISPR arrays are graphically represented with colored shapes, making it easy to visualize similarities and differences across microbial strains This representation is commonly used to facilitate comparison and analyses [17,18,19,20,21,22,23,24,25,26,27,28,29], the development of software to support such analyses remained unexplored until recently when the first automated visualization program, called CRISPRviz, was published [30]. The color-coded visualization is an intuitive method to identify strains with the same phage infection profile and conserved spacers among evolutionary distant microbial species, suggestive of infections by broad host range phages

Materials and Methods
CRISPR Mining and Spacer Extraction
Spacers Comparison and Conversion to HEX Color Code
Default CRISPR Array Clustering
Visualization in a Vector Graphics Editor
Performance and Additional Feature Demonstrations
Speed Evaluation
Color-Coded Improved Visualization
Comparison of color-codingsystems systemsof of CRISPRviz
Customized Sorting and Editing
Manual Inspection
Appending
Graying Out Spacers
Optional
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.