Abstract
Gene duplication is a common and powerful mechanism by which cells create new signaling pathways1,2, but recently duplicated proteins typically must become insulated from each other, and from other paralogs, to prevent unwanted cross-talk3. A similar challenge arises when new sensors or synthetic signaling pathways are engineered within cells or transferred between genomes. How easily new pathways can be introduced into cells depends on the density and distribution of paralogous pathways in the sequence space defined by their specificity-determining residues4,5. Here, we directly probe how crowded sequence space is by generating novel two-component signaling proteins in Escherichia coli using cell sorting coupled to deep-sequencing to analyze large libraries designed based on coevolution patterns. We produce 58 new insulated pathways, in which functional kinase-substrate pairs have different specificities than the parent proteins, and demonstrate that several new pairs are orthogonal to all 27 paralogous pathways in E. coli. Additionally, we readily identify sets of 6 novel kinase-substrate pairs that are mutually orthogonal to each other, significantly increasing the two-component signaling capacity of E. coli. These results indicate that sequence space is not densely occupied. The relative sparsity of paralogs in sequence space suggests that new, insulated pathways can easily arise during evolution or be designed de novo. We demonstrate the latter by engineering a new signaling pathway in E. coli that responds to a plant cytokinin without cross-talk to extant pathways. Our work also demonstrates how coevolution-guided mutagenesis and sequence-space mapping can be used to design large sets of orthogonal protein-protein interactions.
Accepted Version
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have