Bayesian Protein Secondary Structure Prediction With Near-Optimal Segmentations

Zafer Aydin,Yucel Altunbasak,Hakan Erdogan

doi:10.1109/tsp.2007.894404

Abstract

Secondary structure prediction is an invaluable tool in determining the 3-D structure and function of proteins. Typically, protein secondary structure prediction methods suffer from low accuracy in beta-strand predictions, where nonlocal interactions play a significant role. There is a considerable need to model such long- range interactions that contribute to the stabilization of a protein molecule. In this paper, we introduce an alternative decoding technique for the hidden semi-Markov model (HSMM) originally employed in the BSPSS algorithm, and further developed in the IPSSP algorithm. The proposed method is based on the N-best paradigm where a set of most likely segmentations is computed. To generate suboptimal segmentations (i.e., alternative prediction sequences), we developed two N-best search algorithms. The first one is an A* stack decoder algorithm that extends paths (or hypotheses) by one symbol at each iteration. The second algorithm locally keeps the end positions of the highest scoring K previous segments and performs backtracking. Both algorithms employ the hidden semi- Markov model described in Aydin etal. [5], and use Viterbi scoring to compute the N-best list. The availability of near-optimal segmentations and the utilization of the Viterbi scoring enable the sequences to be rescored using more complex dependency models that characterize nonlocal interactions in beta-sheets. After the score update, one can either keep the segmentations to be employed in 3-D structure prediction or predict the secondary structure by applying a weighted voting procedure to a set of top scoring M ges 1 segmentations. The accuracy measures of the N-best method when used to predict the secondary structure are shown to be comparable or better than the classical Viterbi decoder (MAP estimator), tested under the single-sequence condition. When no rescoring is applied, the stack decoder algorithm with sufficiently large M improves the overall sensitivity measure (Q3) of the Viterbi algorithm by 1.1%. At the same M value, the N-best Viterbi algorithm improves the Q3 measure by 0.25% as well as the sensitivity measures specific for each secondary structure type (Qobs alpha, Qobs beta, Qobs L). When the sequences are rescored using the posterior probability distribution computed by the posterior decoding algorithm (MPM estimator), N-best Viterbi improves the Q3 measure of the Viterbi algorithm by 2.6%. The rescored N-best list approach also enables us to generate suboptimal segmentations that are valid sequences (i.e., realizable from the hidden semi-Markov model). Although the N-best algorithms and the score update procedure brought significant improvements over the Viterbi algorithm, they were not able to outperform the posterior decoding algorithm in the single-sequence condition. Further improvements in the prediction accuracy should be possible with the incorporation of sophisticated models for nonlocal interactions and other physical constraints that stabilize the overall structure of a protein.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian Protein Secondary Structure Prediction With Near-Optimal Segmentations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing

Lead the way for us

Journal: IEEE Transactions on Signal Processing	Publication Date: Jul 1, 2007
Citations: 44

Similar Papers

Hybrid computational models for protein sequence analysis and secondary structure prediction

-

09 Jan 2017
09 Jan 2017

Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.
Rhys Heffernan ... Kuldip Paliwal
Bioinformatics | VOL. 33
Rhys Heffernan, et. al.Rhys Heffernan ... Kuldip Paliwal
18 Apr 2017
Bioinformatics | VOL. 33

A Deep Convolutional Neural Network to Improve the Prediction of Protein Secondary Structure
Lin Guo ... Wei Zhou
Current Bioinformatics | VOL. 15
Lin Guo, et. al.Lin Guo ... Wei Zhou
15 Dec 2020
Current Bioinformatics | VOL. 15

Evaluation of in silico protein secondary structure prediction methods by employing statistical techniques
Kandavelmani Angamuthu ... Shanmughavel Piramanayagam
Biomedical and Biotechnology Research Journal (BBRJ) | VOL. 1
Kandavelmani Angamuthu, et. al.Kandavelmani Angamuthu ... Shanmughavel Piramanayagam
01 Jan 2017
Biomedical and Biotechnology Research Journal (BBRJ) | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian Protein Secondary Structure Prediction With Near-Optimal Segmentations

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Signal Processing