Abstract

Drug discovery is widely recognized to be a difficult and costly activity in large part due to the challenge of identifying chemical matter which simultaneously optimizes multiple properties, one of which is affinity for the primary biological target. Further, many of these properties are difficult to predict ahead of expensive and time-consuming compound synthesis and experimental testing. Here we highlight recent work to develop compound affinity prediction models, and extensively investigate the value such models may provide to preclinical drug discovery. We demonstrate that the ability of these models to improve the overall probability of success is crucially dependent on the shape of the error distribution, not just the root-mean-square error. In particular, while scoring more molecule ideas generally improves the probability of project success when the error distribution is Gaussian, fat-tail distributions such as a Cauchy distribution, can lead to a situation where scoring more ideas actually decreases the overall probability of success.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call