Abstract

BackgroundRNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies. It will be useful to obtain an early understanding and association of RNA-binding property of sequences of gene products. Here, we report a web server, RStrucFam, to predict the structure, type of cognate RNA(s) and function(s) of proteins, where possible, from mere sequence information.ResultsThe web server employs Hidden Markov Model scan (hmmscan) to enable association to a back-end database of structural and sequence families. The database (HMMRBP) comprises of 437 HMMs of RBP families of known structure that have been generated using structure-based sequence alignments and 746 sequence-centric RBP family HMMs. The input protein sequence is associated with structural or sequence domain families, if structure or sequence signatures exist. In case of association of the protein with a family of known structures, output features like, multiple structure-based sequence alignment (MSSA) of the query with all others members of that family is provided. Further, cognate RNA partner(s) for that protein, Gene Ontology (GO) annotations, if any and a homology model of the protein can be obtained. The users can also browse through the database for details pertaining to each family, protein or RNA and their related information based on keyword search or RNA motif search.ConclusionsRStrucFam is a web server that exploits structurally conserved features of RBPs, derived from known family members and imprinted in mathematical profiles, to predict putative RBPs from sequence information. Proteins that fail to associate with such structure-centric families are further queried against the sequence-centric RBP family HMMs in the HMMRBP database. Further, all other essential information pertaining to an RBP, like overall function annotations, are provided. The web server can be accessed at the following link: http://caps.ncbs.res.in/rstrucfam.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-016-1289-x) contains supplementary material, which is available to authorized users.

Highlights

  • RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies

  • All the essential information pertaining to RBPs can be obtained

  • Hidden Markov Models of RBP families (HMMRBP) database The database component of RStrucFam, called HMMRBP, holds all the information pertaining to the families that define the search space for the web server

Read more

Summary

Results

Associations to a structural family provides output features like MSSA of the query with all others members of that family, putative cognate RNAs for that protein, GO annotations, if any and a homology model of the protein. The assignment of a protein to an existing structural family helps to predict the putative RNA partner(s) and functions of the protein, based on the observation that members of the same structural family bind to similar RNAs (Additional file 1) and perform similar functions. A randomly selected subset of proteins from our initial dataset were queried against the HMM libraries of structural families Such resubstitution tests showed that 93 % of the proteins could be associated with their parent families (Additional file 3). Such a high sequence identity spread may arise due pure sequence dispersion or occasionally due to the presence of unknown (UNK) residues in the PDBs constituting a family

Conclusions
Background
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call