Molecular Graph Research Articles

Protein-ligand binding affinity plays a pivotal role in drug development, particularly in identifying potential ligands for target disease-related proteins. Accurate affinity predictions can significantly reduce both the time and cost involved in drug development. However, highly precise affinity prediction remains a research challenge. A key to improve affinity prediction is to capture interactions between proteins and ligands effectively. Existing deep-learning-based computational approaches use 3D grids, 4D tensors, molecular graphs, or proximity-based adjacency matrices, which are either resource-intensive or do not directly represent potential interactions. In this paper, we propose atomic-level distance features and attention mechanisms to capture better specific protein-ligand interactions based on donor-acceptor relations, hydrophobicity, and π\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\\pi $$\\end{document}-stacking atoms. We argue that distances encompass both short-range direct and long-range indirect interaction effects while attention mechanisms capture levels of interaction effects. On the very well-known CASF-2016 dataset, our proposed method, named Distance plus Attention for Affinity Prediction (DAAP), significantly outperforms existing methods by achieving Correlation Coefficient (R) 0.909, Root Mean Squared Error (RMSE) 0.987, Mean Absolute Error (MAE) 0.745, Standard Deviation (SD) 0.988, and Concordance Index (CI) 0.876. The proposed method also shows substantial improvement, around 2% to 37%, on five other benchmark datasets. The program and data are publicly available on the website https://gitlab.com/mahnewton/daap.Scientific Contribution StatementThis study innovatively introducesdistance-based features to predict protein-ligand binding affinity, capitalizing onunique molecular interactions. Furthermore, the incorporation of protein sequencefeatures of specific residues enhances the model’s proficiency in capturing intricatebinding patterns. The predictive capabilities are further strengthened through theuse of a deep learning architecture with attention mechanisms, and an ensembleapproach, averaging the outputs of five models, is implemented to ensure robustand reliable predictions.

Read full abstract

The proteins within the human epidermal growth factor receptor (EGFR) family, members of the tyrosine kinase receptor family, play a pivotal role in the molecular mechanisms driving the development of various tumors. Tyrosine kinase inhibitors, key compounds in targeted therapy, encounter challenges in cancer treatment due to emerging drug resistance mutations. Consequently, machine learning has undergone significant evolution to address the challenges of cancer drug discovery related to EGFR family proteins. However, the application of deep learning in this area is hindered by inherent difficulties associated with small-scale data, particularly the risk of overfitting. Moreover, the design of a model architecture that facilitates learning through multi-task and transfer learning, coupled with appropriate molecular representation, poses substantial challenges. In this study, we introduce GraphEGFR, a deep learning regression model designed to enhance molecular representation and model architecture for predicting the bioactivity of inhibitors against both wild-type and mutant EGFR family proteins. GraphEGFR integrates a graph attention mechanism for molecular graphs with deep and convolutional neural networks for molecular fingerprints. We observed that GraphEGFR models employing multi-task and transfer learning strategies generally achieve predictive performance comparable to existing competitive methods. The integration of molecular graphs and fingerprints adeptly captures relationships between atoms and enables both global and local pattern recognition. We further validated potential multi-targeted inhibitors for wild-type and mutant HER1 kinases, exploring key amino acid residues through molecular dynamics simulations to understand molecular interactions. This predictive model offers a robust strategy that could significantly contribute to overcoming the challenges of developing deep learning models for drug discovery with limited data and exploring new frontiers in multi-targeted kinase drug discovery for EGFR family proteins.

Read full abstract

Molecular Graph Research Articles

Related Topics

Articles published on Molecular Graph

Hierarchical multimodal self-attention-based graph neural network for DTI prediction.

MvMRL: a multi-view molecular representation learning method for molecular property prediction.

QSPR analysis for physiochemical properties of new potential antimalarial compounds involving topological indices

MASMDDI: multi-layer adaptive soft-mask graph neural network for drug-drug interaction prediction.

HMMF: a hybrid multi-modal fusion framework for predicting drug side effect frequencies

Computation of molecular description of supramolecular Fuchsine model useful in medical data

Distance plus attention for binding affinity prediction

PointGAT: A Quantum Chemical Property Prediction Model Integrating Graph Attention and 3D Geometry.

Predicting drug-Protein interaction with deep learning framework for molecular graphs and sequences: Potential candidates against SAR-CoV-2.

GraphEGFR: Multi-task and transfer learning based on molecular graph attention mechanism and fingerprints improving inhibitor bioactivity prediction for EGFR family proteins on data scarcity.

Rethinking the Masking Strategy for Pretraining Molecular Graphs from a Data-Centric View.

Molecular property prediction based on graph structure learning.

Exploring expected values of topological indices of random cyclodecane chains for chemical insights

Structure-property modeling of coumarins and coumarin-related compounds in pharmacotherapy of cancer by employing graphical topological indices.

The Second Omega Index

Predicting equilibrium distributions for molecular systems with deep learning

Forgotten Topological and Wiener Indices of Prime Ideal Sum Graph of Zn.

Zagreb Topological Properties of Hexa Organic Molecular Structures.

Weighted Mostar invariants of chemical compounds: An analysis of structural stability

Degree Descriptors and Graph Entropy Quantities of Zeolite ACO.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Molecular Graph Research Articles

Related Topics

Articles published on Molecular Graph

Hierarchical multimodal self-attention-based graph neural network for DTI prediction.

MvMRL: a multi-view molecular representation learning method for molecular property prediction.

QSPR analysis for physiochemical properties of new potential antimalarial compounds involving topological indices

MASMDDI: multi-layer adaptive soft-mask graph neural network for drug-drug interaction prediction.

HMMF: a hybrid multi-modal fusion framework for predicting drug side effect frequencies

Computation of molecular description of supramolecular Fuchsine model useful in medical data

Distance plus attention for binding affinity prediction

PointGAT: A Quantum Chemical Property Prediction Model Integrating Graph Attention and 3D Geometry.

Predicting drug-Protein interaction with deep learning framework for molecular graphs and sequences: Potential candidates against SAR-CoV-2.

GraphEGFR: Multi-task and transfer learning based on molecular graph attention mechanism and fingerprints improving inhibitor bioactivity prediction for EGFR family proteins on data scarcity.

Rethinking the Masking Strategy for Pretraining Molecular Graphs from a Data-Centric View.

Molecular property prediction based on graph structure learning.

Exploring expected values of topological indices of random cyclodecane chains for chemical insights

Structure-property modeling of coumarins and coumarin-related compounds in pharmacotherapy of cancer by employing graphical topological indices.

The Second Omega Index

Predicting equilibrium distributions for molecular systems with deep learning

Forgotten Topological and Wiener Indices of Prime Ideal Sum Graph of Zn.

Zagreb Topological Properties of Hexa Organic Molecular Structures.

Weighted Mostar invariants of chemical compounds: An analysis of structural stability

Degree Descriptors and Graph Entropy Quantities of Zeolite ACO.