Abstract

Efflux proteins are membrane proteins, which are involved in the transportation of multidrugs. The annotation of efflux proteins in genomic sequences would aid to understand the function. Although the percentage of membrane proteins in genomes is estimated to be 25-30%, there is no information about the content of efflux proteins. For annotating such class of proteins it is necessary to develop a reliable method to identify efflux proteins from amino acid sequence information. In this work, we have developed a method based on radial basis function networks using position specific scoring matrices (PSSM) and amino acid properties. We noticed that the C-terminal domain of efflux proteins contain vital information for discrimination. Our method showed an accuracy of 78 and 92% in discriminating efflux proteins from transporters and membrane proteins, respectively using fivefold cross-validation. We utilized our method for annotating the genomes E. coli and P. aeruginosa and it predicted 8.7 and 9.2% of proteins as efflux proteins in these genomes, respectively. The predicted efflux proteins have been compared with available experimental data and we observed a very good agreement between them. Further, we developed a web server for classifying efflux proteins and it is freely available at http://rbf.bioinfo.tw/∼sachen/EFFLUXpredict/Efflux-RBF.php. We suggest that our method could be an effective tool for annotating efflux proteins in genomic sequences.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call