BCov: a method for predicting β-sheet topology using sparse inverse covariance estimation and integer programming

Castrense Savojardo,Rita Casadio,Pier Luigi Martelli,Piero Fariselli

doi:10.1093/bioinformatics/btt555

Castrense Savojardo, Rita Casadio + Show 2 more

Open Access

https://doi.org/10.1093/bioinformatics/btt555

Copy DOI

Journal: Bioinformatics	Publication Date: Sep 23, 2013
Citations: 27	License type: cc-by

Affiliation: University of Bologna

Abstract

Prediction of protein residue contacts, even at the coarse-grain level, can help in finding solutions to the protein structure prediction problem. Unlike α-helices that are locally stabilized, β-sheets result from pairwise hydrogen bonding of two or more disjoint regions of the protein backbone. The problem of predicting contacts among β-strands in proteins has been addressed by several supervised computational approaches. Recently, prediction of residue contacts based on correlated mutations has been greatly improved and finally allows the prediction of 3D structures of the proteins. In this article, we describe BCov, which is the first unsupervised method to predict the β-sheet topology starting from the protein sequence and its secondary structure. BCov takes advantage of the sparse inverse covariance estimation to define β-strand partner scores. Then an optimization based on integer programming is carried out to predict the β-sheet connectivity. When tested on the prediction of β-strand pairing, BCov scores with average values of Matthews Correlation Coefficient (MCC) and F1 equal to 0.56 and 0.61, respectively, on a non-redundant dataset of 916 protein chains known with atomic resolution. Our approach well compares with the state-of-the-art methods trained so far for this specific task. The method is freely available under General Public License at http://biocomp.unibo.it/savojard/bcov/bcov-1.0.tar.gz. The new dataset BetaSheet1452 can be downloaded at http://biocomp.unibo.it/savojard/bcov/BetaSheet1452.dat.

Highlights

In this article, we describe BCov, which is the first unsupervised method to predict the b-sheet topology starting from the protein sequence and its secondary structure
B-Sheets are widespread motifs of local structure found in over 80% of the protein structures presently available in the Protein Data Bank. b-Sheets are generated by the pairing of two or more b-strands held together by characteristic patterns of hydrogen bonds running in a parallel or antiparallel fashion (Zhang and Kim, 2000)
We describe BCov, a new approach for b-sheet topology prediction based on sparse inverse covariance estimation and integer programming

Summary

INTRODUCTION

B-Sheets are widespread motifs of local structure found in over 80% of the protein structures presently available in the Protein Data Bank (http://www.rcsb.org/pdb/home/home.do). b-Sheets are generated by the pairing of two or more b-strands held together by characteristic patterns of hydrogen bonds running in a parallel or antiparallel fashion (Zhang and Kim, 2000). Cheng and Baldi (2005) pioneered the idea of predicting b-sheet topologies when the protein secondary structure is known and set the standard for this type of task Their method BetaPro is based on a 2D-recursive neural network (Baldi and Pollastri, 2003) trained to predict pairing probabilities of interstrand b-residue pairs. Some powerful methods based on the extraction of direct coupling information from MSAs have been introduced to predict protein contacts both in globular (Cocco et al, 2013; Ekeberg et al 2013; Jones et al, 2012; Marks et al, 2011; Morcos et al, 2001; Weigt et al 2009) and membrane proteins (Hopf et al, 2012; Nugent and Jones 2012).

The BetaSheet916 dataset

The BetaSheet1452 dataset

CASP 2010 dataset

MSA construction

BCov general description

Computing the residue contact propensity with PSICOV

Method implementation

Measures of performance

PSICOV performance on b-residue contacts

BCov performance on b-residue contacts

Method

BCov performance at strand level

Performance on the CASP 2010 dataset

Performance on the new BetaSheet1452 dataset

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BCov: a method for predicting β-sheet topology using sparse inverse covariance estimation and integer programming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

DeepConPred2: An Improved Method for the Prediction of Protein Residue Contacts
Wenze Ding ... Haipeng Gong
Computational and Structural Biotechnology Journal | VOL. 16
Wenze Ding, et. al.Wenze Ding ... Haipeng Gong
01 Jan 2018
Computational and Structural Biotechnology Journal | VOL. 16

KScons: a Bayesian approach for protein residue contact prediction using the knob-socket model of protein tertiary structure.
Qiwei Li ... Hyun Joo
Bioinformatics (Oxford, England) | VOL. 32
Qiwei Li, et. al.Qiwei Li ... Hyun Joo
24 Aug 2016
Bioinformatics (Oxford, England) | VOL. 32

Prediction of protein residue contacts with a PDB-derived likelihood matrix.
Michael S Singer ... Robert P Bywater
Protein engineering | VOL. 15
Michael S Singer, et. al.Michael S Singer ... Robert P Bywater
01 Sep 2002
Protein engineering | VOL. 15

COMPUTATIONAL METHODS FOR THE ANALYSIS OF PROTEIN STRUCTURE AND FUNCTION
...
-
, et. al. ...
21 May 2009
21 May 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BCov: a method for predicting β-sheet topology using sparse inverse covariance estimation and integer programming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics