Building Block-Based Binding Predictions for DNA-Encoded Libraries.

Chris Zhang,Nicolas Tilmans,Joe Franklin,Henri Palacci,Svetlana Belyanskaya,Anjali Dixit,Lashadric Grady,Mary Pitman,David L Mobley,Meghan Lawler,Sumudu Leelananda

doi:10.1021/acs.jcim.3c00588

Chris Zhang, Nicolas Tilmans + Show 9 more

Open Access

https://doi.org/10.1021/acs.jcim.3c00588

Copy DOI

Abstract

DNA-encoded libraries (DELs) provide the means to make and screen millions of diverse compounds against a target of interest in a single experiment. However, despite producing large volumes of binding data at a relatively low cost, the DEL selection process is susceptible to noise, necessitating computational follow-up to increase signal-to-noise ratios. In this work, we present a set of informatics tools to employ data from prior DEL screen(s) to gain information about which building blocks are most likely to be productive when designing new DELs for the same target. We demonstrate that similar building blocks have similar probabilities of forming compounds that bind. We then build a model from the inference that the combined behavior of individual building blocks is predictive of whether an overall compound binds. We illustrate our approach on a set of three-cycle OpenDEL libraries screened against soluble epoxide hydrolase (sEH) and report performance of more than an order of magnitude greater than random guessing on a holdout set, demonstrating that our model can serve as a baseline for comparison against other machine learning models on DEL data. Lastly, we provide a discussion on how we believe this informatics workflow could be applied to benefit researchers in their specific DEL campaigns.

Full Text