Abstract

Aim:Generation of an R-group replacement system for compound optimization in medicinal chemistry.Materials & methods:From bioactive compounds, analogue series (ASs) were systematically extracted and from these ASs, all R-groups were isolated and further analyzed.Exemplary results & data:From more than 17,000 ASs, more than 50,000 unique R-groups were isolated. For the 500 most frequently used R-groups, preferred replacements were identified and organized in hierarchies. All original data and an R-group replacement database are made available in an open access deposition.Limitations & next steps:The searchable database has no limitations and can easily be modified using the source data we provide. The next step will be applying this R-group resource in practical medicinal chemistry projects as decision support.

Highlights

  • We have further extended the assessment of R-groups on the basis of analogue series (ASs) [8] by generalizing the approach and identified all R-groups that currently occur in ASs

  • To generalize the approach and identify all possible R-groups, beyond those defined by retrosynthetic rules, we introduced an algorithmic compound–core relationship (CCR) variant in which retrosynthetic fragmentation was replaced by random fragmentation of single exocyclic bonds in compounds

  • We analyzed the frequency with which individual R-groups occurred across all replacements in the global network and found that the top three R-groups in Figure 2 including the H atom, methyl group and phenyl ring were for the most part involved in replacements of other frequently used R-groups

Read more

Summary

Final network

We analyzed the frequency with which individual R-groups occurred across all replacements in the global network and found that the top three R-groups in Figure 2 including the H atom, methyl group and phenyl ring were for the most part involved in replacements of other frequently used R-groups. Compared with the global or final network, this small local network has a dramatically reduced edge density (and is used for representation purposes only) It provides a view of replacements among most frequent R-groups. Data We provide a searchable repository containing the 500 most frequently used R-groups from the final network and their 5 × 2 first and second layer replacement hierarchies. For an R-group of interest, preferred replacements across bioactive compounds can be

Second layer
Summary points

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.