A composite score for predicting errors in protein structure models

David Eramian,Marc A Marti‐Renom,Andrej Sali,Francisco Melo,Damien Devos,Min‐Yi Shen

doi:10.1110/ps.062095806

Abstract

Reliable prediction of model accuracy is an important unsolved problem in protein structure modeling. To address this problem, we studied 24 individual assessment scores, including physics-based energy functions, statistical potentials, and machine learning-based scoring functions. Individual scores were also used to construct approximately 85,000 composite scoring functions using support vector machine (SVM) regression. The scores were tested for their abilities to identify the most native-like models from a set of 6000 comparative models of 20 representative protein structures. Each of the 20 targets was modeled using a template of <30% sequence identity, corresponding to challenging comparative modeling cases. The best SVM score outperformed all individual scores by decreasing the average RMSD difference between the model identified as the best of the set and the model with the lowest RMSD (DeltaRMSD) from 0.63 A to 0.45 A, while having a higher Pearson correlation coefficient to RMSD (r=0.87) than any other tested score. The most accurate score is based on a combination of the DOPE non-hydrogen atom statistical potential; surface, contact, and combined statistical potentials from MODPIPE; and two PSIPRED/DSSP scores. It was implemented in the SVMod program, which can now be applied to select the final model in various modeling problems, including fold assignment, target-template alignment, and loop modeling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A composite score for predicting errors in protein structure models

Abstract

Talk to us

Similar Papers

More From: Protein Science

Lead the way for us

Journal: Protein Science	Publication Date: Jul 1, 2006
Citations: 224

Similar Papers

Family Functioning and Child Psychopathology: Individual versus Composite Family Scores
Jolanda J J P Mathijssen ... Eric E J De Bruyn
Family Relations | VOL. 46
Jolanda J J P Mathijssen, et. al.Jolanda J J P Mathijssen ... Eric E J De Bruyn
01 Jul 1997
Family Relations | VOL. 46

TB-IECS: an accurate machine learning-based scoring function for virtual screening
Xujun Zhang ... Yu Kang
Journal of Cheminformatics | VOL. 15
Xujun Zhang, et. al.Xujun Zhang ... Yu Kang
04 Jul 2023
Journal of Cheminformatics | VOL. 15

A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening.
Jack Scantlebury ... Charlotte M Deane
Journal of chemical information and modeling | VOL. 63
Jack Scantlebury, et. al.Jack Scantlebury ... Charlotte M Deane
11 May 2023
Journal of chemical information and modeling | VOL. 63

Fault Classification of Low-Speed Bearings Based on Support Vector Machine for Regression and Genetic Algorithms Using Acoustic Emission
Henry Ogbemudia Omoregbee ... P Stephan Heyns
Journal of Vibration Engineering & Technologies | VOL. 7
Henry Ogbemudia Omoregbee, et. al.Henry Ogbemudia Omoregbee ... P Stephan Heyns
12 Jun 2019
Journal of Vibration Engineering & Technologies | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A composite score for predicting errors in protein structure models

Abstract

Talk to us

Similar Papers

More From: Protein Science