Triple-negative breast cancer (TNBC) lacks estrogen, progesterone, and HER2 expression, accounting for 15-20% of breast cancer cases. It is challenging due to low therapeutic response, heterogeneity, and aggressiveness. The PI3Ka isoform is a promising therapeutic target, often hyperactivated in TNBC, contributing to uncontrolled growth and cancer cell formation. We have proposed an interpretable deep convolutional neural network prediction (iDCNNPred) system using 2D molecular images to classify bioactivity and identify potential PI3Ka inhibitors. We built Custom-DCNN models and pre-trained models such as AlexNet, SqueezeNet, and VGG19 by using the Bayesian optimization algorithm, and found that our Custom-DCNN model performed better than a pre-trained model with lower complexity and memory usage. All top-performed models were screened with the Maybridge Chemical library to find predictive hit molecules. The screened molecules were further evaluated for protein-ligand interaction with molecular docking and finally 12 promising hits were shortlisted for biological validation using in-vitro PI3K inhibition studies. After biological evaluation, 4 potent molecules with different structural moieties were identified, and these molecules present new starting scaffolds for further improvement in terms of their potency and selectivity as PI3K inhibitors with the help of medicinal chemistry efforts. Furthermore, we also showed the significance of the interpretation and visualization of the model's predictions by the Grad-CAM technique, enhancing the robustness, transparency, and interpretability of the model's predictions. The data and script files and prediction run of models used for this study to reproduce the experiment are available in the GitHub repository at https://github.com/ravishankar1307/iDCNNPred.git .
Read full abstract