AllesTM: predicting multiple structural features of transmembrane proteins

Peter Hönigschmid,Martina Weigl,Dmitrij Frishman,Stephan Breimann

doi:10.1186/s12859-020-03581-8

Abstract

BackgroundThis study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural features no specialized predictors for TM proteins are available at all, and c) deep learning algorithms allow to automate the feature engineering process and thus facilitate the development of multi-target methods for predicting several protein properties at once.ResultsWe present AllesTM, an integrated tool to predict almost all structural features of transmembrane proteins that can be extracted from atomic coordinate data. It blends several machine learning algorithms: random forests and gradient boosting machines, convolutional neural networks in their original form as well as those enhanced by dilated convolutions and residual connections, and, finally, long short-term memory architectures. AllesTM outperforms other available methods in predicting residue depth in the membrane, flexibility, topology, relative solvent accessibility in its bound state, while in torsion angles, secondary structure and monomer relative solvent accessibility prediction it lags only slightly behind the currently leading technique SPOT-1D. High accuracy on a multitude of prediction targets and easy installation make AllesTM a one-stop shop for many typical problems in the structural bioinformatics of transmembrane proteins.ConclusionsIn addition to presenting a highly accurate prediction method and eliminating the need to install and maintain many different software tools, we also provide a comprehensive overview of the impact of different machine learning algorithms and parameter choices on the prediction performance.AllesTM is freely available at https://github.com/phngs/allestm.

Highlights

Starting with the seminal work of Qian and Sejnowski [1], only the sky seems to be the limit for the application of machine learning methods to sequence-based protein structure prediction
In addition to presenting a highly accurate prediction method and eliminating the need to install and maintain many different software tools, we provide a comprehensive overview of the impact of different machine learning algorithms and parameter choices on the prediction performance
While the z-coordinates are the only target where no other method is publicly available for comparison, several conclusions regarding the employed machine learning algorithm can be drawn from the performance overview shown in S1 Table

Summary

Introduction

Starting with the seminal work of Qian and Sejnowski [1], only the sky seems to be the limit for the application of machine learning methods to sequence-based protein structure prediction. One specific advantage of this group of methods is that they allow automating the feature engineering process and eliminate, or at least significantly alleviate, the arguably most time-consuming step in the development of bioinformatics prediction algorithms. This, in its turn, opens up the possibility of developing multi-target prediction methods, i.e. methods that predict a whole array of protein properties directly from input sequences or evolutionary profiles. This study is motivated by the following three considerations: a) the physico-chemical properties of transmembrane (TM) proteins are distinctly different from those of globular proteins, necessitating the development of specialized structure prediction techniques, b) for many structural features no specialized predictors for TM proteins are available at all, and c) deep learning algorithms allow to automate the feature engineering process and facilitate the development of multi-target methods for predicting several protein properties at once

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 12, 2020
Citations: 3	License type: open-access

R Discovery Prime

R Discovery Prime

AllesTM: predicting multiple structural features of transmembrane proteins

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Predicting Bankruptcy at Polish Companies: A Comparison of Selected Machine Learning and Deep Learning Algorithms
Joanna Wyrobek
Zeszyty Naukowe Uniwersytetu Ekonomicznego w Krakowie | VOL. -
Joanna WyrobekJoanna Wyrobek
01 Jan 2018
Zeszyty Naukowe Uniwersytetu Ekonomicznego w Krakowie | VOL. -

Performance Comparison of Machine Learning and Deep Learning Algorithms in Detecting Online Hate Speech
F H A Shibly ... Uzzal Sharma
-
F H A Shibly, et. al.F H A Shibly ... Uzzal Sharma
27 Sep 2022
27 Sep 2022

Prediction of Stroke Outcome Using Natural Language Processing-Based Machine Learning of Radiology Report of Brain MRI.
Tak Sung Heo ... Chulho Kim
Journal of Personalized Medicine | VOL. 10
Tak Sung Heo, et. al.Tak Sung Heo ... Chulho Kim
16 Dec 2020
Journal of Personalized Medicine | VOL. 10

Credit Card Fraud Detection Using State-of-the-Art Machine Learning and Deep Learning Algorithms
Miss Shraddha S Dhatrak ... Miss Janvi S Patil
International Journal of Advanced Research in Science, Communication and Technology | VOL. -
Miss Shraddha S Dhatrak, et. al. Miss Shraddha S Dhatrak ... Miss Janvi S Patil
14 Mar 2024
International Journal of Advanced Research in Science, Communication and Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AllesTM: predicting multiple structural features of transmembrane proteins

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics