Protein Model Quality Research Articles

BackgroundThe selection of the most accurate protein model from a set of alternatives is a crucial step in protein structure prediction both in template-based and ab initio approaches. Scoring functions have been developed which can either return a quality estimate for a single model or derive a score from the information contained in the ensemble of models for a given sequence. Local structural features occurring more frequently in the ensemble have a greater probability of being correct. Within the context of the CASP experiment, these so called consensus methods have been shown to perform considerably better in selecting good candidate models, but tend to fail if the best models are far from the dominant structural cluster. In this paper we show that model selection can be improved if both approaches are combined by pre-filtering the models used during the calculation of the structural consensus.ResultsOur recently published QMEAN composite scoring function has been improved by including an all-atom interaction potential term. The preliminary model ranking based on the new QMEAN score is used to select a subset of reliable models against which the structural consensus score is calculated. This scoring function called QMEANclust achieves a correlation coefficient of predicted quality score and GDT_TS of 0.9 averaged over the 98 CASP7 targets and perform significantly better in selecting good models from the ensemble of server models than any other groups participating in the quality estimation category of CASP7. Both scoring functions are also benchmarked on the MOULDER test set consisting of 20 target proteins each with 300 alternatives models generated by MODELLER. QMEAN outperforms all other tested scoring functions operating on individual models, while the consensus method QMEANclust only works properly on decoy sets containing a certain fraction of near-native conformations. We also present a local version of QMEAN for the per-residue estimation of model quality (QMEANlocal) and compare it to a new local consensus-based approach.ConclusionImproved model selection is obtained by using a composite scoring function operating on single models in order to enrich higher quality models which are subsequently used to calculate the structural consensus. The performance of consensus-based methods such as QMEANclust highly depends on the composition and quality of the model ensemble to be analysed. Therefore, performance estimates for consensus methods based on large meta-datasets (e.g. CASP) might overrate their applicability in more realistic modelling situations with smaller sets of models based on individual methods.

BackgroundReduced representations of proteins have been playing a keyrole in the study of protein folding. Many such models are available, with different representation detail. Although the usefulness of many such models for structural bioinformatics applications has been demonstrated in recent years, there are few intermediate resolution models endowed with an energy model capable, for instance, of detecting native or native-like structures among decoy sets. The aim of the present work is to provide a discrete empirical potential for a reduced protein model termed here PC2CA, because it employs a PseudoCovalent structure with only 2 Centers of interactions per Amino acid, suitable for protein model quality assessment.ResultsAll protein structures in the set top500H have been converted in reduced form. The distribution of pseudobonds, pseudoangle, pseudodihedrals and distances between centers of interactions have been converted into potentials of mean force. A suitable reference distribution has been defined for non-bonded interactions which takes into account excluded volume effects and protein finite size. The correlation between adjacent main chain pseudodihedrals has been converted in an additional energetic term which is able to account for cooperative effects in secondary structure elements. Local energy surface exploration is performed in order to increase the robustness of the energy function.ConclusionThe model and the energy definition proposed have been tested on all the multiple decoys' sets in the Decoys'R'us database. The energetic model is able to recognize, for almost all sets, native-like structures (RMSD less than 2.0 Å). These results and those obtained in the blind CASP7 quality assessment experiment suggest that the model compares well with scoring potentials with finer granularity and could be useful for fast exploration of conformational space. Parameters are available at the url: .

Protein Model Quality Research Articles

Related Topics

Articles published on Protein Model Quality

QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information

Evaluating the absolute quality of a single protein model using structural features and support vector machines

Validation of protein models by a neural network approach

Protein model quality assessment prediction by combining fragment comparisons and a consensus Cα contact potential

Molecular dynamics simulations and membrane protein structure quality

An information theoretic approach for improving data driven prediction of protein model quality

Scoring predictive models using a reduced representation of proteins: model and energy definition

A method for evaluating the structural quality of protein models by using higher-order φ–ψ pairs scoring

Why protein R-factors are so large: a self-consistent analysis.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Protein Model Quality Research Articles

Related Topics

Articles published on Protein Model Quality

QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information

Evaluating the absolute quality of a single protein model using structural features and support vector machines

Validation of protein models by a neural network approach

Protein model quality assessment prediction by combining fragment comparisons and a consensus Cα contact potential

Molecular dynamics simulations and membrane protein structure quality

An information theoretic approach for improving data driven prediction of protein model quality

Scoring predictive models using a reduced representation of proteins: model and energy definition

A method for evaluating the structural quality of protein models by using higher-order φ–ψ pairs scoring

Why protein R-factors are so large: a self-consistent analysis.