Assessing the behavior of machine learning methods to predict the activity of antimicrobial peptides

Francy Liliana Camacho,Rodrigo Torres-Sáez,Raúl Ramos-Pollán

doi:10.19053/01211129.v26.n44.2017.5834

Francy Liliana Camacho, Rodrigo Torres-Sáez + Show 1 more

Open Access

https://doi.org/10.19053/01211129.v26.n44.2017.5834

Copy DOI

Abstract

This study demonstrates the importance of obtaining statistically stable results when using machine learning methods to predict the activity of antimicrobial peptides, due to the cost and complexity of the chemical processes involved in cases where datasets are particularly small (less than a few hundred instances). Like in other fields with similar problems, this results in large variability in the performance of predictive models, hindering any attempt to transfer them to lab practice. Rather than targeting good peak performance obtained from very particular experimental setups, as reported in related literature, we focused on characterizing the behavior of the machine learning methods, as a preliminary step to obtain reproducible results across experimental setups, and, ultimately, good performance. We propose a methodology that integrates feature learning (autoencoders) and selection methods (genetic algorithms) thorough the exhaustive use of performance metrics (permutation tests and bootstrapping), which provide stronger statistical evidence to support investment decisions with the lab resources at hand. We show evidence for the usefulness of 1) the extensive use of computational resources, and 2) adopting a wider range of metrics than those reported in the literature to assess method performance. This approach allowed us to guide our quest for finding suitable machine learning methods, and to obtain results comparable to those in the literature with strong statistical stability.

Highlights

Different methods of pattern recognition have been used to estimate the activity of biological molecules
Comparative results for different algorithms used to predict the activity of antimicrobial peptides
We found that the models using Genetic Algorithms (GA) and SAE2 had low variability, and that changing the data in the train/test would result in good performances (Table 4)

Summary

Introduction

Different methods of pattern recognition have been used to estimate the activity of biological molecules. Some of the methods that have been used to predict antimicrobial peptides include Partial Least Squares [2, 3], Artificial Neural Networks [4], Multiple Linear Regression [5, 6], and Support Vector Regression (SVR) [7,8,9], among others. Performance assessment of these methods is typically limited to few metrics obtained with fixed validation sets, measuring the distance of prediction from the real output, but providing little evidence on whether the used methods have found a real correlation. Our work takes a resampling approach, where data are split several times to ensure the statistical robustness of the results

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing the behavior of machine learning methods to predict the activity of antimicrobial peptides

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Revista Facultad de Ingeniería

Lead the way for us

Journal: Revista Facultad de Ingeniería	Publication Date: Dec 31, 2016
License type: CC BY-NC 4.0

Similar Papers

Overall Survival Prognostic Modelling of Non-small Cell Lung Cancer Patients Using Positron Emission Tomography/Computed Tomography Harmonised Radiomics Features: The Quest for the Optimal Machine Learning Algorithm
Mehdi Amini ... Habib Zaidi
Clinical Oncology | VOL. 34
Mehdi Amini, et. al.Mehdi Amini ... Habib Zaidi
03 Dec 2021
Clinical Oncology | VOL. 34

Machine Learning and Network Methods for Biology and Medicine
Lei Chen ... Dandan Li
Computational and Mathematical Methods in Medicine | VOL. 2015
Lei Chen, et. al.Lei Chen ... Dandan Li
01 Jan 2015
Computational and Mathematical Methods in Medicine | VOL. 2015

Genomic prediction in plants: opportunities for ensemble machine learning based approaches.
Muhammad Farooq ... Harm Nijveen
F1000Research | VOL. 11
Muhammad Farooq, et. al.Muhammad Farooq ... Harm Nijveen
10 Jan 2023
F1000Research | VOL. 11

Genomic prediction in plants: opportunities for ensemble machine learning based approaches.
Muhammad Farooq ... Shahid Mansoor
F1000Research | VOL. 11
Muhammad Farooq, et. al.Muhammad Farooq ... Shahid Mansoor
18 Jul 2022
F1000Research | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing the behavior of machine learning methods to predict the activity of antimicrobial peptides

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Revista Facultad de Ingeniería