Active pharmaceutical ingredients (APIs) have gained direct pharmaceutical interest, along with their in vitro properties, and thus utilized as auxiliary solid dosage forms upon FDA guidance and approval on pharmaceutical cocrystals when reacting with coformers, as a potential and attractive route for drug substance development. However, screening and selecting suitable and appropriate coformers that may potentially react with APIs to successfully form cocrystals is a time-consuming, inefficient, economically expensive, and labour-intensive task. In this study, we implemented GNNs to predict the formation of cocrystals using our introduced API-coformers relational graph data. We further compared our work with previous studies that implemented descriptor-based models (e.g., random forest, support vector machine, extreme gradient boosting, and artificial neural networks). All built graph-based models show compelling performance accuracies (i.e., 91.36, 94.60 and 95. 95% for GCN, GraphSAGE, and RGCN respectively). RGCN demonstrated effectiveness and prevailed among the built graph-based models due to its capability to capture intricate and learn nuanced relationships between entities such as non-ionic and non-covalent interactions or link information between APIs and coformers which are crucial for accurate predictions and representations. These capabilities allows the model to adeptly learn the topological structure inherent in the graph data.
Read full abstract