Identification of small molecule aggregators from large compound libraries by support vector machines

Hanbing Rao,Yuzong Chen,Hu Li,Xiangyuan Li,Choongyong Ung,Xianghui Liu,Zerong Li,Xiaohua Ma

doi:10.1002/jcc.21347

Abstract

Small molecule aggregators non-specifically inhibit multiple unrelated proteins, rendering them therapeutically useless. They frequently appear as false hits and thus need to be eliminated in high-throughput screening campaigns. Computational methods have been explored for identifying aggregators, which have not been tested in screening large compound libraries. We used 1319 aggregators and 128,325 non-aggregators to develop a support vector machines (SVM) aggregator identification model, which was tested by four methods. The first is five fold cross-validation, which showed comparable aggregator and significantly improved non-aggregator identification rates against earlier studies. The second is the independent test of 17 aggregators discovered independently from the training aggregators, 71% of which were correctly identified. The third is retrospective screening of 13M PUBCHEM and 168K MDDR compounds, which predicted 97.9% and 98.7% of the PUBCHEM and MDDR compounds as non-aggregators. The fourth is retrospective screening of 5527 MDDR compounds similar to the known aggregators, 1.14% of which were predicted as aggregators. SVM showed slightly better overall performance against two other machine learning methods based on five fold cross-validation studies of the same settings. Molecular features of aggregation, extracted by a feature selection method, are consistent with published profiles. SVM showed substantial capability in identifying aggregators from large libraries at low false-hit rates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identification of small molecule aggregators from large compound libraries by support vector machines

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Chemistry

Lead the way for us

Journal: Journal of Computational Chemistry	Publication Date: Jun 30, 2009
Citations: 28

Similar Papers

A support vector machines approach for virtual screening of active compounds of single and multiple mechanisms from large libraries at an improved hit-rate and enrichment factor
L.Y Han ... Y.Z Chen
Journal of Molecular Graphics and Modelling | VOL. 26
L.Y Han, et. al.L.Y Han ... Y.Z Chen
15 Dec 2007
Journal of Molecular Graphics and Modelling | VOL. 26

Development of Ligand-based Big Data Deep Neural Network Models for Virtual Screening of Large Compound Libraries.
Tao Xiao ... Yuyang Jiang
Molecular informatics | VOL. 37
Tao Xiao, et. al.Tao Xiao ... Yuyang Jiang
08 Jun 2018
Molecular informatics | VOL. 37

Development and experimental test of support vector machines virtual screening method for searching Src inhibitors from large compound libraries.
Bucong Han ... Xianghui Liu
Chemistry Central Journal | VOL. 6
Bucong Han, et. al.Bucong Han ... Xianghui Liu
23 Nov 2012
Chemistry Central Journal | VOL. 6

Virtual Screening of Abl Inhibitors from Large Compound Libraries by Support Vector Machines
X H Liu ... B C Low
Journal of Chemical Information and Modeling | VOL. 49
X H Liu, et. al.X H Liu ... B C Low
18 Aug 2009
Journal of Chemical Information and Modeling | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of small molecule aggregators from large compound libraries by support vector machines

Abstract

Talk to us

Similar Papers

More From: Journal of Computational Chemistry