Abstract

The Cys2His2 zinc finger (ZF) is the most frequently found sequence-specific DNA-binding domain in eukaryotic proteins. The ZF’s modular protein–DNA interface has also served as a platform for genome engineering applications. Despite decades of intense study, a predictive understanding of the DNA-binding specificities of either natural or engineered ZF domains remains elusive. To help fill this gap, we developed an integrated experimental-computational approach to enrich and recover distinct groups of ZFs that bind common targets. To showcase the power of our approach, we built several large ZF libraries and demonstrated their excellent diversity. As proof of principle, we used one of these ZF libraries to select and recover thousands of ZFs that bind several 3-nt targets of interest. We were then able to computationally cluster these recovered ZFs to reveal several distinct classes of proteins, all recovered from a single selection, to bind the same target. Finally, for each target studied, we confirmed that one or more representative ZFs yield the desired specificity. In sum, the described approach enables comprehensive large-scale selection and characterization of ZF specificities and should be a great aid in furthering our understanding of the ZF domain.

Highlights

  • The Cys2His2 zinc finger (ZF) is the most common DNAbinding domain found in metazoans [1,2,3]

  • Zinc finger library builds by polymerase chain reaction (PCR)-driven cassette mutagenesis

  • We recognized that the routine production of large and diverse ZF libraries could greatly expand the number of known protein–DNA interactions mediated by this domain

Read more

Summary

Introduction

The Cys2His zinc finger (ZF) is the most common DNAbinding domain found in metazoans [1,2,3]. 50% of the transcription factors (TFs) in the human genome are thought to use ZFs to recognize their targets [2,3,4], yet characterization of their DNA-binding specificities has proven difficult. Within these factors, a single ZF domain binds 3–4 bases of DNA, whereas proteins often contain arrays of multiple adjacent ZFs (Figure 1). Modular assembly of ZFs has a high failure rate when challenged to activate a cell-based reporter assay, perhaps due to conflicting preferences in overlap nucleotides, indicating that we do not fully understand the dependencies between neighboring fingers that influence functional assembly [13]. ZF-nucleases and recombinases have enabled fine genome editing and have proven function in many model organisms [17,18,19,20,21,22,23]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call