Consensus Data Mining (CDM) Protein Secondary Structure Prediction Server: Combining GOR V and Fragment Database Mining (FDM)

Haitao Cheng,Taner Z Sen,Andrzej Kloczkowski,Robert L Jernigan

doi:10.1093/bioinformatics/btm379

Abstract

One of the challenges in protein secondary structure prediction is to overcome the cross-validated 80% prediction accuracy barrier. Here, we propose a novel approach to surpass this barrier. Instead of using a single algorithm that relies on a limited data set for training, we combine two complementary methods having different strengths: Fragment Database Mining (FDM) and GOR V. FDM harnesses the availability of the known protein structures in the Protein Data Bank and provides highly accurate secondary structure predictions when sequentially similar structural fragments are identified. In contrast, the GOR V algorithm is based on information theory, Bayesian statistics, and PSI-BLAST multiple sequence alignments to predict the secondary structure of residues inside a sliding window along a protein chain. A combination of these two different methods benefits from the large number of structures in the PDB and significantly improves the secondary structure prediction accuracy, resulting in Q3 ranging from 67.5 to 93.2%, depending on the availability of highly similar fragments in the Protein Data Bank.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Consensus Data Mining (CDM) Protein Secondary Structure Prediction Server: Combining GOR V and Fragment Database Mining (FDM)

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Jul 27, 2007
Citations: 44

Similar Papers

Data Mining for Protein Secondary Structure Prediction
Haitao Cheng ... Robert L Jernigan
-
Haitao Cheng, et. al.Haitao Cheng ... Robert L Jernigan
01 Jan 2009
01 Jan 2009

Accuracy of Identical Subsequences Based Protein Secondary Structure Prediction
Faruk Berat Akcesme ... Muhamed Adilovic
Southeast Europe Journal of Soft Computing | VOL. 6
Faruk Berat Akcesme, et. al.Faruk Berat Akcesme ... Muhamed Adilovic
24 May 2017
Southeast Europe Journal of Soft Computing | VOL. 6

Prediction of secondary structural content of proteins from their amino acid composition alone. II. The paradox with secondary structural class.
Frank Eisenhaber ... Cornelius Frömmel
Proteins | VOL. 25
Frank Eisenhaber, et. al.Frank Eisenhaber ... Cornelius Frömmel
01 Jun 1996
Proteins | VOL. 25

Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information.
Gianluca Pollastri ... Catherine Mooney
BMC Bioinformatics | VOL. 8
Gianluca Pollastri, et. al.Gianluca Pollastri ... Catherine Mooney
14 Jun 2007
BMC Bioinformatics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Consensus Data Mining (CDM) Protein Secondary Structure Prediction Server: Combining GOR V and Fragment Database Mining (FDM)

Abstract

Talk to us

Similar Papers

More From: Bioinformatics