Abstract

Predicting the functional consequences of single point mutations has relevance to protein function annotation and to clinical analysis/diagnosis. We developed and tested Packpred that makes use of a multi-body clique statistical potential in combination with a depth-dependent amino acid substitution matrix (FADHM) and positional Shannon entropy to predict the functional consequences of point mutations in proteins. Parameters were trained over a saturation mutagenesis data set of T4-lysozyme (1,966 mutations). The method was tested over another saturation mutagenesis data set (CcdB; 1,534 mutations) and the Missense3D data set (4,099 mutations). The performance of Packpred was compared against those of six other contemporary methods. With MCC values of 0.42, 0.47, and 0.36 on the training and testing data sets, respectively, Packpred outperforms all methods in all data sets, with the exception of marginally underperforming in comparison to FADHM in the CcdB data set. A meta server analysis was performed that chose best performing methods of wild-type amino acids and for wild-type mutant amino acid pairs. This led to an increase in the MCC value of 0.40 and 0.51 for the two meta predictors, respectively, on the Missense3D data set. We conjecture that it is possible to improve accuracy with better meta predictors as among the seven methods compared, at least one method or another is able to correctly predict ∼99% of the data.

Highlights

  • Amino acid substitutions could affect protein stability, alter/impair its function, and possibly lead to disease conditions (Zhang et al, 2012)

  • In the SIFT method (Ng and Henikoff, 2003), mutational effect prediction is made based on a Packpred: Predicting Effects of Mutations customized position-specific substitution matrix (PSSM), constructed using PSI-BLAST (Altschul et al, 1997) and MOTIF finder (Smith et al, 1990) to identify conserved local sequence regions

  • SDM (Pandurangan et al, 2017), which does not rely on machine learning, constructs an environment-specific amino acid substitution matrix based on observed substitutions in evolutionary time

Read more

Summary

INTRODUCTION

Amino acid substitutions could affect protein stability, alter/impair its function, and possibly lead to disease conditions (Zhang et al, 2012). Polyphen (Adzhubei et al, 2010) is a hybrid method that combines sequence and structural features to predict the effect of a mutation. It uses an improved version of PSSM, information from the Pfam database, and structural features such as accessible surface area and volume of an amino acid to make a prediction. SuSPect (Yates et al, 2014) is another hybrid-based method that uses PSSMs and Pfam domain profiles (Finn et al, 2014) It includes information from protein–protein interaction networks and searches in the database for known functional annotations of a mutated position. The lesser the variation, the greater is the functional importance of the residue

MATERIALS AND METHODS
RESULTS
Method
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call