While predicting a ligand that binds to a protein is feasible with current methods, the opposite, i.e., the prediction of a receptor for a ligand remains challenging. We present an approach for predicting receptors of a given ligand that uses de novo design and structural bioinformatics. We have developed the algorithm CRD, comprising multiple modules combining fragment-based sub-site finding, a machine learning function to estimate the size of the site, a genetic algorithm that encodes knowledge on protein structures and a physics-based fitness scoring scheme. CRD includes a pseudo-receptor design component followed by a mapping component to identify proteins that might contain these sites. CRD recovers the sites and receptors of several natural ligands. It designs similar sites for similar ligands, yet to some extent can distinguish between closely related ligands. CRD correctly predicts receptor classes for several drugs and might become a valuable tool for drug discovery.
Read full abstract