Bidirectional best hit r-window gene clusters

Melvin Zhang,Hon Wai Leong

doi:10.1186/1471-2105-11-s1-s63

Abstract

BackgroundConserved gene clusters are groups of genes that are located close to one another in the genomes of several species. They tend to code for proteins that have a functional interaction. The identification of conserved gene clusters is an important step towards understanding genome evolution and predicting gene function.ResultsIn this paper, we propose a novel pairwise gene cluster model that combines the notion of bidirectional best hits with the r-window model introduced in 2003 by Durand and Sankoff. The bidirectional best hit (BBH) constraint removes the need to specify the minimum number of shared genes in the r-window model and improves the relevance of the results. We design a subquadratic time algorithm to compute the set of BBH r-window gene clusters efficiently.ConclusionWe apply our cluster model to the comparative analysis of E. coli K-12 and B. subtilis and perform an extensive comparison between our new model and the gene teams model developed by Bergeron et al. As compared to the gene teams model, our new cluster model has a slightly lower recall but a higher precision at all levels of recall when the results were ranked using statistical tests. An analysis of the most significant BBH r-window gene cluster show that they correspond to known operons.

Highlights

Conserved gene clusters are groups of genes that are located close to one another in the genomes of several species
We apply our cluster model to the comparative analysis of E. coli K-12 and B. subtilis and perform an extensive comparison between our new model and the gene teams model developed by Bergeron et al As compared to the gene teams model, our new cluster model has a slightly lower recall but a higher precision at all levels of recall when the results were ranked using statistical tests
We investigated the power of our bidirectional best hit (BBH) r-window model by applying it to the analysis of conserved gene clusters between E. coli K-12 and B. subtilis and comparing our results with that obtained by [8] based on the gene teams model [7]

Summary

Introduction

Conserved gene clusters are groups of genes that are located close to one another in the genomes of several species. The identification of conserved gene clusters is an important step towards understanding genome evolution and predicting gene function. It is well-known that the differences between the genomes of extant species can be attributed to both small and large-scale mutations [1]. Comparison of multiple genomes based on their gene orders – the sequence of genetic markers – reveal segments with homologous gene content. These segments are commonly referred to as conserved gene cluster. The most well studied examples are co-transcribed genes, known as (page number not for citation purposes)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 1, 2010
Citations: 30	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Bidirectional best hit r-window gene clusters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Density-based binning of gene clusters to infer function or evolutionary history using GeneGrouper.
Alexander G Mcfarland ... Nolan W Kennedy
Bioinformatics (Oxford, England) | VOL. 38
Alexander G Mcfarland, et. al.Alexander G Mcfarland ... Nolan W Kennedy
04 Nov 2021
Bioinformatics (Oxford, England) | VOL. 38

AtlasT4SS: A curated database for type IV secretion systems
Rangel C Souza ... Guadalupe Del Rosario Quispe Saji
BMC Microbiology | VOL. 12
Rangel C Souza, et. al.Rangel C Souza ... Guadalupe Del Rosario Quispe Saji
09 Aug 2012
BMC Microbiology | VOL. 12

Phylogenetic detection of conserved gene clusters in microbial genomes
Yu Zheng ... Richard J Roberts
BMC Bioinformatics | VOL. 6
Yu Zheng, et. al.Yu Zheng ... Richard J Roberts
03 Oct 2005
BMC Bioinformatics | VOL. 6

Algorithms for Computing Bidirectional Best Hit r-Window Gene Clusters
Trong Dao Le ... Hon Wai Leong
-
Trong Dao Le, et. al.Trong Dao Le ... Hon Wai Leong
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bidirectional best hit r-window gene clusters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics