Large Chemical Space Research Articles

Graphs are one of the most natural and powerful representations available for molecules; natural because they have an intuitive correspondence to skeletal formulas, the language used by chemists worldwide, and powerful, because they are highly expressive both globally (molecular topology) and locally (atom and bond properties). Graph kernels are used to transform molecular graphs into fixed-length vectors, which, based on their capacity of measuring similarity, can be used as fingerprints for machine learning (ML). To date, graph kernels have mostly focused on the atomic nodes of the graph. In this work, we developed a graph kernel based on atom-atom, bond-bond, and bond-atom (AABBA) autocorrelations. The resulting vector representations were tested on regression ML tasks on a data set of transition metal complexes; a benchmark motivated by the higher complexity of these compounds relative to organic molecules. In particular, we tested different flavors of the AABBA kernel in the prediction of the energy barriers and bond distances of the Vaska's complex data set (Friederich et al., Chem. Sci., 2020, 11, 4584). For a variety of ML models, including neural networks, gradient boosting machines, and Gaussian processes, we showed that AABBA outperforms the baseline including only atom-atom autocorrelations. Dimensionality reduction studies also showed that the bond-bond and bond-atom autocorrelations yield many of the most relevant features. We believe that the AABBA graph kernel can accelerate the exploration of large chemical spaces and inspire novel molecular representations in which both atomic and bond properties play an important role.

Read full abstract

Significance: Cellular senescence is a critical process underlying aging and is associated with age-related diseases such as Alzheimer's disease. Lipids are implicated in cellular senescence. Fatty acids, particularly eicosanoids, have been associated with various forms of senescence and inflammation, and the associated reactive oxygen species production has been proposed as a therapeutic target for mitigating senescence. When overactivated, calcium-dependent phospholipase A2 (cPLA2) catalyzes the conversion of arachidonic acid into eicosanoids such as leukotrienes and prostaglandins. Recent Advances: With a growing understanding of the importance of lipids as mediators and modulators of senescence, cPLA2 has emerged as a compelling drug target. cPLA2 overactivation plays a significant role in several pathways associated with senescence, including neuroinflammation and oxidative stress. Critical Issues: Previous cPLA2 inhibitors have shown potential in ameliorating inflammation and oxidative stress, but the dominant hurdles in the central nervous system-targeting drug discovery are specificity and blood-brain barrier penetrance. Future Directions: With the need for more effective drugs against neurological diseases, we emphasize the significance of discovering new brain-penetrant, potent, and specific cPLA2 inhibitors. We discuss how the recently developed Virtual Synthon Hierarchical Enumeration Screening, an iterative synthon-based approach for fast structure-based virtual screening of billions of compounds, provides an efficient exploration of large chemical spaces for the discovery of brain-penetrant cPLA2 small-molecule inhibitors. Antioxid. Redox Signal. 00, 000-000.

Read full abstract

Large Chemical Space Research Articles

Related Topics

Articles published on Large Chemical Space

AABBA Graph Kernel: Atom-Atom, Bond-Bond, and Bond-Atom Autocorrelations for Machine Learning.

Development of Calcium-Dependent Phospholipase A2 Inhibitors to Target Cellular Senescence and Oxidative Stress in Neurodegenerative Diseases.

Leveraging infrared spectroscopy for automated structure elucidation

Active Learning Framework for Expediting the Search of Thermodynamically Stable MXenes in the Extensive Chemical Space.

Identification of novel and potent inhibitors of SARS-CoV-2 main protease from DNA-encoded chemical libraries.

Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning

Identification of Novel Tyrosinase Inhibitors with Nanomolar Potency Using Virtual Screening Approaches.

Oxadiazolines as Photoreleasable Labels for Drug Target Identification.

Challenges Reconciling Theory and Experiments in the Prediction of Lattice Thermal Conductivity: The Case of Cu-Based Sulvanites.

Enabling target-aware molecule generation to follow multi objectives with Pareto MCTS

Massively parallel analysis of single-molecule dynamics on next-generation sequencing chips.

Identifying Synergistic Components of Botanical Fungicide Formulations Using Interpretable Graph Neural Networks.

Accelerated Discovery of Halide Perovskite Materials via Computational Methods: A Review.

ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries.

A Fast and Efficient Molecular Networking Approach for Reactivity of Natural Products Exploration in Plant Extracts: Application to Diterpene Esters from Euphorbia dendroides.

Design Principles Guided by DFT Calculations and High-Throughput Frameworks for the Discovery of New Diamond-like Chalcogenide Thermoelectric Materials.

Guided Docking as a Data Generation Approach Facilitates Structure-Based Machine Learning on Kinases.

Exploring the Limits of the Generalized CHARMM and AMBER Force Fields through Predictions of Hydration Free Energy of Small Molecules.

SAF: Smart Aggregation Framework for Revealing Atoms Importance Rank and Improving Prediction Rates in Drug Discovery.

ML-Aided Computational Screening of 2D Materials for Photocatalytic Water Splitting.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Chemical Space Research Articles

Related Topics

Articles published on Large Chemical Space

AABBA Graph Kernel: Atom-Atom, Bond-Bond, and Bond-Atom Autocorrelations for Machine Learning.

Development of Calcium-Dependent Phospholipase A2 Inhibitors to Target Cellular Senescence and Oxidative Stress in Neurodegenerative Diseases.

Leveraging infrared spectroscopy for automated structure elucidation

Active Learning Framework for Expediting the Search of Thermodynamically Stable MXenes in the Extensive Chemical Space.

Identification of novel and potent inhibitors of SARS-CoV-2 main protease from DNA-encoded chemical libraries.

Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning

Identification of Novel Tyrosinase Inhibitors with Nanomolar Potency Using Virtual Screening Approaches.

Oxadiazolines as Photoreleasable Labels for Drug Target Identification.

Challenges Reconciling Theory and Experiments in the Prediction of Lattice Thermal Conductivity: The Case of Cu-Based Sulvanites.

Enabling target-aware molecule generation to follow multi objectives with Pareto MCTS

Massively parallel analysis of single-molecule dynamics on next-generation sequencing chips.

Identifying Synergistic Components of Botanical Fungicide Formulations Using Interpretable Graph Neural Networks.

Accelerated Discovery of Halide Perovskite Materials via Computational Methods: A Review.

ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries.

A Fast and Efficient Molecular Networking Approach for Reactivity of Natural Products Exploration in Plant Extracts: Application to Diterpene Esters from Euphorbia dendroides.

Design Principles Guided by DFT Calculations and High-Throughput Frameworks for the Discovery of New Diamond-like Chalcogenide Thermoelectric Materials.

Guided Docking as a Data Generation Approach Facilitates Structure-Based Machine Learning on Kinases.

Exploring the Limits of the Generalized CHARMM and AMBER Force Fields through Predictions of Hydration Free Energy of Small Molecules.

SAF: Smart Aggregation Framework for Revealing Atoms Importance Rank and Improving Prediction Rates in Drug Discovery.

ML-Aided Computational Screening of 2D Materials for Photocatalytic Water Splitting.