Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling

Michal Brylinski

doi:10.1186/1477-5956-11-47

Abstract

BackgroundA growing body of evidence shows that gene products encoded by short open reading frames play key roles in numerous cellular processes. Yet, they are generally overlooked in genome assembly, escaping annotation because small protein-coding genes are difficult to predict computationally. Consequently, there are still a considerable number of small proteins whose functions are yet to be characterized.ResultsTo address this issue, we apply a collection of structural bioinformatics algorithms to infer molecular function of putative small proteins from the mouse proteome. Specifically, we construct 1,743 confident structure models of small proteins, which reveal a significant structural diversity with a noticeably high helical content. A subsequent structure-based function annotation of small protein models exposes 178,745 putative protein-protein interactions with the remaining gene products in the mouse proteome, 1,100 potential binding sites for small organic molecules and 987 metal-binding signatures.ConclusionsThese results strongly indicate that many small proteins adopt three-dimensional structures and are fully functional, playing important roles in transcriptional regulation, cell signaling and metabolism. Data collected through this work is freely available to the academic community at http://www.brylinski.org/content/databases to support future studies oriented on elucidating the functions of hypothetical small proteins.

Highlights

A growing body of evidence shows that gene products encoded by short open reading frames play key roles in numerous cellular processes
The development of generation sequencing (NGS) enables researchers to reach into almost complete genomes of numerous species [2,3], revealing more and more details on individual organisms functioning as systems
In this study, we apply a collection of tools for evolution/ structure-based function annotation of small proteins identified in the mouse proteome

Summary

Introduction

A growing body of evidence shows that gene products encoded by short open reading frames play key roles in numerous cellular processes. They are generally overlooked in genome assembly, escaping annotation because small protein-coding genes are difficult to predict computationally. Difficulties of de novo NGS assembly arise from e.g. contaminating sequences [4], low-quality reads [5], segmental duplications and large common repeats [6]. Contaminating sequences [4], low-quality reads [5], segmental duplications and large common repeats [6] Another salient flaw is a short-length discontinuity, which has been noted for several assembled genomes [7,8]. Several highlighted biological functions include engaging in regulatory processes [14], interacting with a lipid membrane [15] or even modulating its features, acting as chaperones of nucleic acids and metals [16], and stabilizing the structures of larger protein assemblies [17]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proteome Science	Publication Date: Jan 1, 2013
Citations: 77	License type: cc-by

R Discovery Prime

R Discovery Prime

Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proteome Science

Lead the way for us

Similar Papers

Setting up a Meta-Threading Pipeline for High-Throughput Structural Bioinformatics: eThread Software Distribution, Walkthrough and Resource Profiling

Journal of Computer Science & Systems Biology | VOL. 06

01 Jan 2013
Journal of Computer Science & Systems Biology | VOL. 06

Exploring Human Diseases and Biological Mechanisms by Protein Structure Prediction and Modeling.
Juexin Wang ... Zheng Wang
Advances in experimental medicine and biology | VOL. 939
Juexin Wang, et. al.Juexin Wang ... Zheng Wang
01 Jan 2015
Advances in experimental medicine and biology | VOL. 939

Searching in microbial genomes for encoded small proteins
Jos Boekhorst ... Greer Wilson
Microbial Biotechnology | VOL. 4
Jos Boekhorst, et. al.Jos Boekhorst ... Greer Wilson
25 Apr 2011
Microbial Biotechnology | VOL. 4

Diversity of Translation Start Sites May Define Increased Complexity of the Human Short ORFeome
Masaaki Oyama ... Sumio Sugano
Molecular & Cellular Proteomics | VOL. 6
Masaaki Oyama, et. al.Masaaki Oyama ... Sumio Sugano
01 Jun 2007
Molecular & Cellular Proteomics | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the “dark matter” of a mammalian proteome by protein structure and function modeling

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proteome Science