ProteinUnet-An efficient alternative to SPIDER3-single for sequence-based prediction of protein secondary structures.

Krzysztof Kotowski,Tomasz Smolarczyk,Katarzyna Stapor,Irena Roterman‐Konieczna

doi:10.1002/jcc.26432

Abstract

Predicting protein function and structure from sequence remains an unsolved problem in bioinformatics. The best performing methods rely heavily on evolutionary information from multiple sequence alignments, which means their accuracy deteriorates for sequences with a few homologs, and given the increasing sequence database sizes requires long computation times. Here, a single‐sequence‐based prediction method is presented, called ProteinUnet, leveraging an U‐Net convolutional network architecture. It is compared to SPIDER3‐Single model, based on long short‐term memory‐bidirectional recurrent neural networks architecture. Both methods achieve similar results for prediction of secondary structures (both three‐ and eight‐state), half‐sphere exposure, and contact number, but ProteinUnet has two times fewer parameters, 17 times shorter inference time, and can be trained 11 times faster. Moreover, ProteinUnet tends to be better for short sequences and residues with a low number of local contacts. Additionally, the method of loss weighting is presented as an effective way of increasing accuracy for rare secondary structures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of computational chemistry	Publication Date: Oct 15, 2020
Citations: 19	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

ProteinUnet-An efficient alternative to SPIDER3-single for sequence-based prediction of protein secondary structures.

Abstract

Talk to us

Similar Papers

More From: Journal of computational chemistry

Lead the way for us

Similar Papers

Single-sequence-based prediction of protein secondary structures and solvent accessibility by deep whole-sequence learning.
Rhys Heffernan ... Kuldip Paliwal
Journal of Computational Chemistry | VOL. 39
Rhys Heffernan, et. al.Rhys Heffernan ... Kuldip Paliwal
05 Oct 2018
Journal of Computational Chemistry | VOL. 39

Data Mining for Protein Secondary Structure Prediction
Haitao Cheng ... Taner Z Sen
-
Haitao Cheng, et. al.Haitao Cheng ... Taner Z Sen
01 Jan 2009
01 Jan 2009

Secondary and Tertiary Structure Prediction of Proteins: A Bioinformatic Approach
Minu Kesheri ... Rajeshwar Prasad Sinha
-
Minu Kesheri, et. al.Minu Kesheri ... Rajeshwar Prasad Sinha
30 Nov 2014
30 Nov 2014

Improved Protein Secondary Structure Prediction Using a Intelligent HSVM Method with a New Encoding Scheme
Haifeng Sui ... Lijun Wang
International Journal of Advancements in Computing Technology | VOL. 3
Haifeng Sui , et. al.Haifeng Sui ... Lijun Wang
30 Apr 2011
International Journal of Advancements in Computing Technology | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ProteinUnet-An efficient alternative to SPIDER3-single for sequence-based prediction of protein secondary structures.

Abstract

Talk to us

Similar Papers

More From: Journal of computational chemistry