Abstract

Virtual screening based on protein-ligand docking is widely applied at the early stage of drug discovery. Scoring functions from a diverse set of existing protein-ligand docking tools, however, often poorly distinguish bioactive compounds from inactive ones. As a result, considerable effort has been devoted to the combination of multiple scoring functions for more reliable evaluation. State-of-the-art consensus scoring or ensemble learning methods assume each scoring function performs uniformly for all cases. Case-based meta-learning (CBML), the method we have developed, is fundamentally different. It identifies the best predictor for a specific new case based on its similarity to old cases and uses that method to predict rather than average the performance of all predictors. Our large-scale benchmark studies clearly indicate that CBML outperforms consensus-based scoring and significantly improves the performance of structure-based virtual screening. The CBML paradigm can be extended to other applications in bioinformatics and chemoinformatics for robust and reliable predictive modeling.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.