Abstract

A method called "Emerging Chemical Patterns" (ECP) has recently been introduced as a novel approach to binary molecular classification (for example, "active" versus "inactive"). The underlying pattern recognition algorithm was first introduced in computer science and then adopted for applications in medicinal chemistry and compound screening. A special feature is its ability to accurately classify molecules on the basis of very small training sets containing only a few compounds. This feature is highly relevant for virtual compound screening when only very few experimental hits are available as templates. Here we adopt ECP calculations to simulate sequential screening using an experimental high-throughput screening (HTS) data set containing inhibitors of dihydrofolate reductase. In doing so, we focus on minimizing the number of database compounds that need to be evaluated in order to identify a substantial fraction of available hits. We demonstrate that iterative ECP calculations recover on average between approximately 19% and approximately 39% of available hits in the data set while dramatically reducing the number of compounds that need to be tested to between approximately 0.002% and approximately 9% of the screening database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.