Abstract

Protein–RNA interactions are implicated in a number of physiological roles as well as diseases, with molecular mechanisms ranging from defects in RNA splicing, localization and translation to the formation of aggregates. Currently, ∼1400 human proteins have experimental evidence of RNA-binding activity. However, only ∼250 of these proteins currently have experimental data on their target RNAs from various sequencing-based methods such as eCLIP. To bridge this gap, we used an established, computationally expensive protein–RNA interaction prediction method, catRAPID, to populate a large database, RNAct. RNAct allows easy lookup of known and predicted interactions and enables global views of the human, mouse and yeast protein–RNA interactomes, expanding them in a genome-wide manner far beyond experimental data (http://rnact.crg.eu).

Highlights

  • RNA-binding proteins (RBPs) are key in RNA splicing, processing, export, localization and regulation of translation and are implicated in a number of pathologies in humans

  • Human proteins encoded by 1393 genes currently have experimental evidence of RNA-binding activity [4,5,6]

  • Normalizing the prediction score by sequence lengths, to a previous work [19], we found that the predictive performance decreases slightly

Read more

Summary

INTRODUCTION

RNA-binding proteins (RBPs) are key in RNA splicing, processing, export, localization and regulation of translation and are implicated in a number of pathologies in humans. Human proteins encoded by 1393 genes currently have experimental evidence of RNA-binding activity [4,5,6]. These proteins contain one or more RNA-binding regions, either in the form of canonical globular domains or of more recently discovered, intrinsically disordered RNA interaction regions [7,8]. Our database covers the H. sapiens, M. musculus and S. cerevisiae genomes and contains a total of 5.87 billion pairwise interactions. RNAct makes available our genome-wide protein–RNA interaction predictions and combines them with powerful and intuitive search functionality, including pairwise search for sets of proteins and RNAs. The display is enriched with useful annotation, including transcript support level (TSL) and APPRIS classification for isoforms and RNA subcellular localization from the RNALocate database.

MATERIALS AND METHODS
DISCUSSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.