Abstract

We have recently developed a tool, MoKa, to predict the p K a of organic compounds using a large dataset of over 26,500 literature p K a values as a training set. However, predicting accurately p K a (<0.5 pH units) remains challenging for novel series, and this can be a drawback in the optimization of activity and ADME properties of lead compounds. To address this issue it is important to expand our knowledge of p K a determinants, therefore we have conducted high-throughput p K a measurements by using Spectral Gradient Analysis (SGA) on novel series of compounds selected from vendor databases. Here we report our findings on the effect of specific chemical groups and steric constraints on the p K a of common functionalities in medicinal chemistry, such as amines, sulfonamides, and amides. Furthermore, we report the p K a of ionizable groups that were not well represented in the database of literature p K a of MoKα, such as hydrazide derivatives. These findings helped us to enhance MoKα, which is here benchmarked on a set of experimental p K a values from the Roche in-house library ( N = 5581; RMSE = 1.09; R2 = 0.82). The accuracy of the predictions was greatly improved (RMSE = 0.49, R2 = 0.96) after training the software by using the automated tool Kibitzer with 6226 p K a values taken from a different set of Roche compounds appropriately selected, and this demonstrates the value of using high-throughput p K a measurements to expand the training set of p K a values used by the software MoKα.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call