AMPL: A Data-Driven Modeling Pipeline for Drug Discovery.

Amanda J Minnich,Jason Deng,Benjamin D Madej,Andrew Weber,Bharath Ramsundar,Jim Brase,Neha Murad,Stacie Calad-Thomson,Kevin Mcloughlin,Margaret Tse,Tom Rush,Jonathan E Allen

doi:10.1021/acs.jcim.9b01053

Abstract

One of the key requirements for incorporating machine learning (ML) into the drug discovery process is complete traceability and reproducibility of the model building and evaluation process. With this in mind, we have developed an end-to-end modular and extensible software pipeline for building and sharing ML models that predict key pharma-relevant parameters. The ATOM Modeling PipeLine, or AMPL, extends the functionality of the open source library DeepChem and supports an array of ML and molecular featurization tools. We have benchmarked AMPL on a large collection of pharmaceutical data sets covering a wide range of parameters. Our key findings indicate that traditional molecular fingerprints underperform other feature representation methods. We also find that data set size correlates directly with prediction performance, which points to the need to expand public data sets. Uncertainty quantification can help predict model error, but correlation with error varies considerably between data sets and model types. Our findings point to the need for an extensible pipeline that can be shared to make model building more widely accessible and reproducible. This software is open source and available at: https://github.com/ATOMconsortium/AMPL.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Chemical Information and Modeling	Publication Date: Apr 3, 2020
Citations: 65	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

AMPL: A Data-Driven Modeling Pipeline for Drug Discovery.

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling

Lead the way for us

Similar Papers

Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial
Venkat Nemani ... Chao Hu
Mechanical Systems and Signal Processing | VOL. 205
Venkat Nemani, et. al.Venkat Nemani ... Chao Hu
19 Oct 2023
Mechanical Systems and Signal Processing | VOL. 205

Machine learning to predict post-operative acute kidney injury stage 3 after heart transplantation
Tingyu Li ... Rui Chen
BMC Cardiovascular Disorders | VOL. 22
Tingyu Li, et. al.Tingyu Li ... Rui Chen
25 Jun 2022
BMC Cardiovascular Disorders | VOL. 22

Dynamic Autoselection and Autotuning of Machine Learning Models for Cloud Network Analytics
Rupesh Raj Karn ... Prabhakar Kudva
IEEE Transactions on Parallel and Distributed Systems | VOL. 30
Rupesh Raj Karn, et. al.Rupesh Raj Karn ... Prabhakar Kudva
01 May 2019
IEEE Transactions on Parallel and Distributed Systems | VOL. 30

Facilitating Machine Learning Model Comparison and Explanation through a Radial Visualisation
Jianlong Zhou ... Fang Chen
Energies | VOL. 14
Jianlong Zhou, et. al.Jianlong Zhou ... Fang Chen
28 Oct 2021
Energies | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AMPL: A Data-Driven Modeling Pipeline for Drug Discovery.

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling