Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.

Leyi Wei,Quan Zou

doi:10.3390/ijms17122118

Abstract

Knowledge on protein folding has a profound impact on understanding the heterogeneity and molecular function of proteins, further facilitating drug design. Predicting the 3D structure (fold) of a protein is a key problem in molecular biology. Determination of the fold of a protein mainly relies on molecular experimental methods. With the development of next-generation sequencing techniques, the discovery of new protein sequences has been rapidly increasing. With such a great number of proteins, the use of experimental techniques to determine protein folding is extremely difficult because these techniques are time consuming and expensive. Thus, developing computational prediction methods that can automatically, rapidly, and accurately classify unknown protein sequences into specific fold categories is urgently needed. Computational recognition of protein folds has been a recent research hotspot in bioinformatics and computational biology. Many computational efforts have been made, generating a variety of computational prediction methods. In this review, we conduct a comprehensive survey of recent computational methods, especially machine learning-based methods, for protein fold recognition. This review is anticipated to assist researchers in their pursuit to systematically understand the computational recognition of protein folds.

Highlights

Understanding how proteins adopt their 3D structure remains one of the greatest challenges in science
Determination of protein structure relies on traditional experimental methods, such as X-ray crystallography and nuclear magnetic resonance spectroscopy
We have systematically reviewed the recent progress in machine learning-based protein fold recognition methods

Summary

Introduction

Understanding how proteins adopt their 3D structure remains one of the greatest challenges in science. To determine the optimal alignments, scoring functions are usually used as measures to evaluate the similarity between the profiles derived from a query protein and those of template proteins with known structures. Jaroszewski et al [1] developed a protein recognition method called Fold and Function Assignment System (FFAS) by using a profile-profile alignment strategy without using any structural information. Template-free methods seek to build models and accurately predict protein structures solely based on amino acid sequences rather than on known structural proteins as templates. Machine learning aims to build a prediction model by learning the differences between different protein fold categories and use the learned model to automatically assign a query protein to a specific protein fold class This approach is more efficient for large-scale predictions and can examine a large number of promising candidates for further experimental validation. We evaluate and compare the recognition performance of existing methods used in the last 10 years on a benchmark dataset

Databases

SCOP and SCOP2

Ensemble Classifier-Based Methods

Comparisons with Different Methods on Benchmark Dataset

Findings

Conclusions and Perspectives

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Molecular Sciences	Publication Date: Dec 16, 2016
Citations: 77	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Molecular Sciences

Lead the way for us

Similar Papers

Computational Methods for Protein Structure Prediction and Fold Recognition
Iwona A Cymerman ... Marcin Pawłowski
-
Iwona A Cymerman, et. al.Iwona A Cymerman ... Marcin Pawłowski
01 Jan 2008
01 Jan 2008

Recent Trends in Machine Learning-based Protein Fold Recognition Methods
-
Biointerface Research in Applied Chemistry | VOL. 11
--
10 Dec 2020
Biointerface Research in Applied Chemistry | VOL. 11

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition
Iain Melvin ... William Stafford Noble
BMC Bioinformatics | VOL. 8
Iain Melvin, et. al.Iain Melvin ... William Stafford Noble
01 May 2007
BMC Bioinformatics | VOL. 8

The recognition of multi-class protein folds by adding average chemical shifts of secondary structure elements
Zhenxing Feng ... Muhammad Aqeel Ashraf
Saudi Journal of Biological Sciences | VOL. 23
Zhenxing Feng, et. al.Zhenxing Feng ... Muhammad Aqeel Ashraf
11 Dec 2015
Saudi Journal of Biological Sciences | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Molecular Sciences