ChemSAR: an online pipelining platform for molecular SAR modeling

Jie Dong,Min-Feng Zhu,Ai-Ping Lu,Zhi-Jiang Yao,Ben Lu,Alex F Chen,Hongyu Miao,Wen-Bin Zeng,Dong-Sheng Cao,Ning-Ning Wang

doi:10.1186/s13321-017-0215-1

Abstract

BackgroundIn recent years, predictive models based on machine learning techniques have proven to be feasible and effective in drug discovery. However, to develop such a model, researchers usually have to combine multiple tools and undergo several different steps (e.g., RDKit or ChemoPy package for molecular descriptor calculation, ChemAxon Standardizer for structure preprocessing, scikit-learn package for model building, and ggplot2 package for statistical analysis and visualization, etc.). In addition, it may require strong programming skills to accomplish these jobs, which poses severe challenges for users without advanced training in computer programming. Therefore, an online pipelining platform that integrates a number of selected tools is a valuable and efficient solution that can meet the needs of related researchers.ResultsThis work presents a web-based pipelining platform, called ChemSAR, for generating SAR classification models of small molecules. The capabilities of ChemSAR include the validation and standardization of chemical structure representation, the computation of 783 1D/2D molecular descriptors and ten types of widely-used fingerprints for small molecules, the filtering methods for feature selection, the generation of predictive models via a step-by-step job submission process, model interpretation in terms of feature importance and tree visualization, as well as a helpful report generation system. The results can be visualized as high-quality plots and downloaded as local files.ConclusionChemSAR provides an integrated web-based platform for generating SAR classification models that will benefit cheminformatics and other biomedical users. It is freely available at: http://chemsar.scbdd.com.Graphical abstract.

Highlights

In recent years, predictive models based on machine learning techniques have proven to be feasible and effective in drug discovery
In the drug discovery field, machine learning methods are frequently applied to build in silico predictive models in studies of structure–activity relationships (SAR) and structure–property relationships (SPR) to assess or predict various drug activities [8, 9], and
The most important strategy of pharmaceutical industry to overcome its productivity crisis in drug discovery is to focus on the molecular properties of absorption, distribution, metabolism and excretion (ADME)

Summary

Introduction

Predictive models based on machine learning techniques have proven to be feasible and effective in drug discovery. To develop such a model, researchers usually have to combine multiple tools and undergo several different steps (e.g., RDKit or ChemoPy package for molecular descriptor calculation, ChemAxon Standardizer for structure preprocessing, scikit-learn package for model building, and ggplot package for statistical analysis and visualization, etc.). In the drug discovery field, machine learning methods are frequently applied to build in silico predictive models in studies of structure–activity relationships (SAR) and structure–property relationships (SPR) to assess or predict various drug activities [8, 9], and ADME/T properties [10,11,12,13,14,15,16].

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Cheminformatics	Publication Date: May 4, 2017
Citations: 49	License type: open-access

R Discovery Prime

R Discovery Prime

ChemSAR: an online pipelining platform for molecular SAR modeling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Cheminformatics

Lead the way for us

Similar Papers

ChemDes: an integrated web-based platform for molecular descriptor and fingerprint computation.
Jie Dong ... Yong-Huan Yun
Journal of Cheminformatics | VOL. 7
Jie Dong, et. al.Jie Dong ... Yong-Huan Yun
01 Dec 2015
Journal of Cheminformatics | VOL. 7

Machine learning in computational docking.
Mohamed A Khamis ... Walid Gomaa
Artificial Intelligence in Medicine | VOL. 63
Mohamed A Khamis, et. al.Mohamed A Khamis ... Walid Gomaa
16 Feb 2015
Artificial Intelligence in Medicine | VOL. 63

NASPGHAN Guidelines for Training in Pediatric Gastroenterology
Alan M Leichtner ... Paul A Rufo
Journal of Pediatric Gastroenterology and Nutrition | VOL. 56
Alan M Leichtner, et. al.Alan M Leichtner ... Paul A Rufo
01 Jan 2013
Journal of Pediatric Gastroenterology and Nutrition | VOL. 56

A machine-learning-assisted study of the permeability of small drug-like molecules across lipid membranes.
Guang Chen ... Ying Li
Physical chemistry chemical physics : PCCP | VOL. 22
Guang Chen, et. al.Guang Chen ... Ying Li
01 Jan 2020
Physical chemistry chemical physics : PCCP | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ChemSAR: an online pipelining platform for molecular SAR modeling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Cheminformatics