A methodology for evaluating multi-objective evolutionary feature selection for classification in the context of virtual screening

Fernando Jiménez,José Palma,Carlos Martínez,Gracia Sánchez,Horacio Pérez-Sánchez

doi:10.1007/s00500-018-3479-0

Abstract

Virtual screening (VS) methods have been shown to increase success rates in many drug discovery campaigns, when they complement experimental approaches, such as high-throughput screening methods or classical medicinal chemistry approaches. Nevertheless, predictive capability of VS is not yet optimal, mainly due to limitations in the underlying physical principles describing drug binding phenomena. One approach that can improve VS methods is the aid of machine learning methods. When enough experimental data are available to train such methods, predictive capability can considerably increase. We show in this research work how a multi-objective evolutionary search strategy for feature selection, which can provide with small and accurate decision trees that can be very easily understood by chemists, can drastically increase the applicability and predictive ability of these techniques and therefore aid considerable in the drug discovery problem. With the proposed methodology, we find classification models with accuracy between 0.9934 and 1.00 and area under ROC between 0.96 and 1.00 evaluated in full training sets, and accuracy between 0.9849 and 0.9940 and area under ROC between 0.89 and 0.93 evaluated with tenfold cross-validation over 30 iterations, while substantially reducing the model size.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A methodology for evaluating multi-objective evolutionary feature selection for classification in the context of virtual screening

Abstract

Talk to us

Similar Papers

More From: Soft Computing

Lead the way for us

Journal: Soft Computing	Publication Date: Sep 1, 2018
Citations: 9

Similar Papers

Improving drug discovery using a neural networks based parallel scoring function
Horacio Perez-Sanchez ... Jose M Garcia
-
Horacio Perez-Sanchez, et. al.Horacio Perez-Sanchez ... Jose M Garcia
01 Aug 2013
01 Aug 2013

Improving drug discovery using hybrid softcomputing methods
Horacio Pérez-Sánchez ... José García-Rodríguez
Applied Soft Computing | VOL. 20
Horacio Pérez-Sánchez, et. al.Horacio Pérez-Sánchez ... José García-Rodríguez
28 Nov 2013
Applied Soft Computing | VOL. 20

Optimization Methods for Virtual Screening on Novel Computational Architectures
Horacio Perez-Sanchez ... Wolfgang Wenzel
Current Computer Aided-Drug Design | VOL. 7
Horacio Perez-Sanchez, et. al.Horacio Perez-Sanchez ... Wolfgang Wenzel
01 Mar 2011
Current Computer Aided-Drug Design | VOL. 7

Enhancing the Parallelization of Non-bonded Interactions Kernel for Virtual Screening on GPUs
Baldomero Imbernón ... José L Abellán
-
Baldomero Imbernón, et. al.Baldomero Imbernón ... José L Abellán
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A methodology for evaluating multi-objective evolutionary feature selection for classification in the context of virtual screening

Abstract

Talk to us

Similar Papers

More From: Soft Computing