PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications

Divya B Korlepara,Charuvaka Muvva,Vishal Kumar,C S Vasavi,U Deva Priyakumar,Bhuvanesh Sridharan,Divya Nayar,Sarvesh Mehta,Akshit Garg,Rohit Modee,Pradeep Kumar Pal,Agastya P Bhati,Shruti Jeurkar,Shubham Sharma,Subhajit Roy

doi:10.1038/s41597-022-01631-9

Abstract

Computational methods and recently modern machine learning methods have played a key role in structure-based drug design. Though several benchmarking datasets are available for machine learning applications in virtual screening, accurate prediction of binding affinity for a protein-ligand complex remains a major challenge. New datasets that allow for the development of models for predicting binding affinities better than the state-of-the-art scoring functions are important. For the first time, we have developed a dataset, PLAS-5k comprised of 5000 protein-ligand complexes chosen from PDB database. The dataset consists of binding affinities along with energy components like electrostatic, van der Waals, polar and non-polar solvation energy calculated from molecular dynamics simulations using MMPBSA (Molecular Mechanics Poisson-Boltzmann Surface Area) method. The calculated binding affinities outperformed docking scores and showed a good correlation with the available experimental values. The availability of energy components may enable optimization of desired components during machine learning-based drug design. Further, OnionNet model has been retrained on PLAS-5k dataset and is provided as a baseline for the prediction of binding affinities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific data	Publication Date: Sep 7, 2022
Citations: 7	License type: open-access

R Discovery Prime

R Discovery Prime

PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications

Abstract

Talk to us

Similar Papers

More From: Scientific data

Lead the way for us

Similar Papers

Ligand binding affinity prediction with fusion of graph neural networks and 3D structure-based complex graph.
Lina Dong ... Binju Wang
Physical chemistry chemical physics : PCCP | VOL. 25
Lina Dong, et. al.Lina Dong ... Binju Wang
01 Jan 2023
Physical chemistry chemical physics : PCCP | VOL. 25

Evaluating the performance of MM/PBSA for binding affinity prediction using class A GPCR crystal structures.
Mei Qian Yau ... Jason S E Loo
Journal of Computer-Aided Molecular Design | VOL. 33
Mei Qian Yau, et. al.Mei Qian Yau ... Jason S E Loo
15 Apr 2019
Journal of Computer-Aided Molecular Design | VOL. 33

Computational repurposing approach for targeting the critical spike mutations in B.1.617.2 (delta), AY.1 (delta plus) and C.37 (lambda) SARS-CoV-2 variants using exhaustive structure-based virtual screening, molecular dynamic simulations and MM-PBSA methods
Maryam Ebrahimi ... Mahdi Alijanianzadeh
Computers in Biology and Medicine | VOL. 147
Maryam Ebrahimi, et. al.Maryam Ebrahimi ... Mahdi Alijanianzadeh
07 Jun 2022
Computers in Biology and Medicine | VOL. 147

Validation of an automated procedure for the prediction of relative free energies of binding on a set of aldose reductase inhibitors
Anna Maria Ferrari ... Giulio Rastelli
Bioorganic & Medicinal Chemistry | VOL. 15
Anna Maria Ferrari, et. al.Anna Maria Ferrari ... Giulio Rastelli
22 Aug 2007
Bioorganic & Medicinal Chemistry | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications

Abstract

Talk to us

Similar Papers

More From: Scientific data