Abstract

BackgroundThe order of genes in bacterial genomes is not random; for example, the products of genes belonging to an operon work together in the same pathway. The cotranslational assembly of protein complexes is deemed to conserve genomic neighborhoods even stronger than a common function. This is why a conserved genomic neighborhood can be utilized to predict, whether gene products form protein complexes.ResultsWe were interested to assess the performance of a neighborhood-based classifier that analyzes a large number of genomes. Thus, we determined for the genes encoding the subunits of 494 experimentally verified hetero-dimers their local genomic context. In order to generate phylogenetically comprehensive genomic neighborhoods, we utilized the tools offered by the Enzyme Function Initiative. For each subunit, a sequence similarity network was generated and the corresponding genome neighborhood network was analyzed to deduce the most frequent gene product. This was predicted as interaction partner, if its abundance exceeded a threshold, which was the frequency giving rise to the maximal Matthews correlation coefficient. For the threshold of 16%, the true positive rate was 45%, the false positive rate 0.06%, and the precision 55%. For approximately 20% of the subunits, the interaction partner was not found in a neighborhood of ± 10 genes.ConclusionsOur phylogenetically comprehensive analysis confirmed that complex formation is a strong evolutionary factor that conserves genome neighborhoods. On the other hand, for 55% of the cases analyzed here, classification failed. Either, the interaction partner was not present in a ± 10 gene window or was not the most frequent gene product.

Highlights

  • The order of genes in bacterial genomes is not random; for example, the products of genes belonging to an operon work together in the same pathway

  • An example is the trp operon of Escherichia coli that consists of the five genes trpA – trpE catalyzing tryptophan biosynthesis from chorismate [5]

  • Deducing phylogenetically diverse genomic neighborhoods of bona fide bacterial hetero-dimers We were interested to find out how reliable the genomic neighborhood indicates for a given subunit of a heteromeric complex the interaction partner, if we analyze its neighborhood in genomes from many phylogenetically diverse species

Read more

Summary

Introduction

The order of genes in bacterial genomes is not random; for example, the products of genes belonging to an operon work together in the same pathway. The cotranslational assembly of protein complexes is deemed to conserve genomic neighborhoods even stronger than a common function. This is why a conserved genomic neighborhood can be utilized to predict, whether gene products form protein complexes. An example is the trp operon of Escherichia coli that consists of the five genes trpA – trpE catalyzing tryptophan biosynthesis from chorismate [5].

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call