Abstract

The CRISPR-Cas9 system has recently evolved as a powerful mutagenic tool for targeted genome editing. The impeccable functioning of the system depends on the optimal design of single guide RNAs (sgRNAs) that mainly involves sgRNA specificity and on-target cleavage efficacy. Several research groups have designed algorithms and models, trained on mammalian genomes, for predicting sgRNAs cleavage efficacy. These models are also implemented in most plant sgRNA design tools due to the lack of on-target cleavage efficacy studies in plants. However, one of the major drawbacks is that almost all of these models are biased for considering only coding regions of the DNA while excluding ineffective regions, which are of immense importance in functional genomics studies especially for plants, thus making prediction less reliable. In the present study, we evaluate the on-target cleavage efficacy of experimentally validated sgRNAs designed against diverse ineffective regions of Arabidopsis thaliana genome using various statistical tests. We show that nucleotide preference in protospacer adjacent motif (PAM) proximal region, GC content in the PAM proximal seed region, intact RAR and 3rd stem loop structures, and free accessibility of nucleotides in seed and tracrRNA regions of sgRNAs are important determinants associated with their high on-target cleavage efficacy. Thus, our study describes the features important for plant sgRNAs high on-target cleavage efficacy against ineffective genomic regions previously shown to give rise to ineffective sgRNAs. Moreover, it suggests the need of developing an elaborative plant-specific sgRNA design model considering the entire genomic landscape including ineffective regions for enabling highly efficient genome editing without wasting time and experimental resources.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call