Abstract
<span lang="EN-US">Recently, advancements in computational and artificial intelligence (AI) methods have contributed in improving research results in the field of drug discovery. In fact, machine learning techniques have proven to be especially effective in this regard, aiding in the development of new drug variants and enabling more precise targeting of specific disease mechanisms. In this paper, we propose to use a quantitative structure-activity relationship-based approach for predicting active compounds related to non-small cell lung cancer. Our approach uses a neural network classifier that learns from sequential structures and chemical properties of molecules, as well as a gradient boosting tree classifier to conduct comparative analysis. To evaluate the contribution of each feature, we employ Shapley additive explanations (SHAP) summary plots to perform features selection. Our approach involves a dataset of active and non-active molecules collected from ChEMBL database. Our results show the effectiveness of the proposed approach when it comes to predicting accurately active compounds for lung cancer. Furthermore, our comparative analysis reveals important chemical structures that contribute to the effectiveness of the compounds. Thus, the proposed approach can greatly enhance the drug discovery pipeline and may lead to the development of new and effective treatments for lung cancer.</span>
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have