Abstract

The Human Genome Project ultimately aimed to translate DNA sequence into drugs. With the draft in hand, the Molecular Libraries Program set out to prosecute all genome-encoded proteins for drug discovery with automated high-throughput screening (HTS). This ambitious vision remains unfulfilled, even while innovations in sequencing technology have fully democratized access to genome-scale sequencing. Why? While the central dogma of biology allows us to chart the entirety of cellular metabolism through sequencing, there is no direct coding for chemistry. The rules of base pairing that relate DNA gene to RNA transcript and amino acid sequence do not exist for relating small-molecule structure with macromolecular binding partners and subsequently cellular function. Obtaining such relationships genome-wide is unapproachable via state-of-the-art HTS, akin to attempting genome-wide association studies using turn-of-the-millennium Sanger DNA sequencing.Our laboratory has been engaged in a multipronged technology development campaign to revolutionize molecular screening through miniaturization in pursuit of genome-scale drug discovery capabilities. The compound library was ripe for miniaturization: it clearly needed to become a consumable. We employed DNA-encoded library (DEL) synthesis principles in the development of solid-phase DELs prepared on microscopic beads, each harboring 100 fmol of a single library member and a DNA tag whose sequence describes the structure of the library member. Loading these DEL beads into 100 pL microfluidic droplets followed by online photocleavage, incubation, fluorescence-activated droplet sorting, and DNA sequencing of the sorted DEL beads reveals the chemical structures of bioactive compounds. This scalable library synthesis and screening platform has proven useful in several proof-of-concept projects involving current clinical targets.Moving forward, we face the problem of druggability and proteome-scale assay development. Developing biochemical or cellular assays for all genome-encoded targets is not scalable and likely impossible as most proteins have ill-defined or unknown activity and may not function outside of their native contexts. These are the dark undruggable expanses, and charting them will require advanced synthesis and analytical technologies that can generalize probe discovery, irrespective of mature protein function, to fulfill the Genome Project's vision of proteome-wide control of cellular pharmacology.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call