Abstract

Protein concentrations depend not only on the mRNA level, but also on the translation rate and the degradation rate. Prediction of mRNA's translation rate would provide valuable information for in-depth understanding of the translation mechanism and dynamic proteome. In this study, we developed a new computational model to predict the translation rate, featured by (1) integrating various sequence-derived and functional features, (2) applying the maximum relevance & minimum redundancy method and incremental feature selection to select features to optimize the prediction model, and (3) being able to predict the translation rate of RNA into high or low translation rate category. The prediction accuracies under rich and starvation condition were 68.8% and 70.0%, respectively, evaluated by jackknife cross-validation. It was found that the following features were correlated with translation rate: codon usage frequency, some gene ontology enrichment scores, number of RNA binding proteins known to bind its mRNA product, coding sequence length, protein abundance and 5′UTR free energy. These findings might provide useful information for understanding the mechanisms of translation and dynamic proteome. Our translation rate prediction model might become a high throughput tool for annotating the translation rate of mRNAs in large-scale.

Highlights

  • It is often assumed that genes with high mRNA levels have high protein abundance

  • Our results suggest that the following features are correlated with translation rate: codon usage frequency, some gene ontology enrichment scores, biochemical and physicochemical features of protein, number of RNA binding proteins known to bind its mRNA product, coding sequence length, protein abundance and 59UTR free energy

  • In rigorous jackknife cross-validation test, the predictor can achieve an overall prediction accuracy of 68.8% and 70.0% in rich and starvation conditions, respectively

Read more

Summary

Introduction

It is often assumed that genes with high mRNA levels have high protein abundance. MRNA levels are used instead of protein abundance. Many studies either could not find the assumed correlation between mRNA level and protein abundance [1] or the correlation was very weak[2,3]. Only 20%–40% of protein abundance is determined by the concentration of its corresponding mRNA [4,5]. The reason for such weak correlation between protein and mRNA levels is that protein concentrations depend on the mRNA level, and the translation rate and the degradation rate [6]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call