Screening synteny blocks in pairwise genome comparisons through integer programming

Haibao Tang,Michael Freeling,James C Schnable,Andrew H Paterson,Eric Lyons,Brent Pedersen

doi:10.1186/1471-2105-12-102

Haibao Tang, Michael Freeling + Show 4 more

Open Access

https://doi.org/10.1186/1471-2105-12-102

Copy DOI

Abstract

BackgroundIt is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD) events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to reflect "one-to-many" or more generally "many-to-many" matchings. However this relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. For many downstream analyses, we need to eliminate weak, low scoring alignments from pairwise genome comparisons. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison. We call this "quota-based" screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events.ResultsWe have formulated the synteny block screening as an optimization problem known as "Binary Integer Programming" (BIP), which is solved using existing linear programming solvers. The computer program QUOTA-ALIGN performs this task by creating a clear objective function that maximizes the compatible set of synteny blocks under given constraints on overlaps and depths (corresponding to the duplication history in respective genomes). Such a procedure is useful for any pairwise synteny alignments, but is most useful in lineages affected by multiple WGDs, like plants or fish lineages. For example, there should be a 1:2 ploidy relationship between genome A and B if genome B had an independent WGD subsequent to the divergence of the two genomes. We show through simulations and real examples using plant genomes in the rosid superorder that the quota-based screening can eliminate ambiguous synteny blocks and focus on specific genomic evolutionary events, like the divergence of lineages (in cross-species comparisons) and the most recent WGD (in self comparisons).ConclusionsThe QUOTA-ALIGN algorithm screens a set of synteny blocks to retain only those compatible with a user specified ploidy relationship between two genomes. These blocks, in turn, may be used for additional downstream analyses such as identifying true orthologous regions in interspecific comparisons. There are two major contributions of QUOTA-ALIGN: 1) reducing the block screening task to a BIP problem, which is novel; 2) providing an efficient software pipeline starting from all-against-all BLAST to the screened synteny blocks with dot plot visualizations. Python codes and full documentations are publicly available http://github.com/tanghaibao/quota-alignment. QUOTA-ALIGN program is also integrated as a major component in SynMap http://genomevolution.com/CoGe/SynMap.pl, offering easier access to thousands of genomes for non-programmers.

Highlights

It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor
A typical pipeline for genome structure comparison starts with the enumeration of “synteny blocks” regions of chromosomes between two or more input genomes that shared a common order of homologous genes and are inferred to be derived from a common ancestor
The algorithm we present here, called QUOTAALIGN, is a method that screens synteny blocks based on the expected number of subgenomes, effectively eliminating more ancient or spurious alignments

Summary

Introduction

It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. To compare multiple “subgenomes” derived from genome duplications, we need to relax the traditional requirements of “one-to-one” syntenic matchings of genomic regions in order to reflect “one-to-many” or more generally “many-to-many” matchings. This relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison We call this “quota-based” screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events. Synteny blocks are often viewed as “diagonals” on a syntenic dot plot, where dots represent putative homologous gene pairs or marker pairs as inferred by sequence similarity (Figure 1)

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Apr 18, 2011
Citations: 165	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Screening synteny blocks in pairwise genome comparisons through integer programming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Contrasting patterns of evolution following whole genome versus tandem duplication events inPopulus
Eli Rodgers-Melnick ... Gancho T Slavov
Genome Research | VOL. 22
Eli Rodgers-Melnick, et. al.Eli Rodgers-Melnick ... Gancho T Slavov
05 Oct 2011
Genome Research | VOL. 22

Linked by Ancestral Bonds: Multiple Whole-Genome Duplications and Reticulate Evolution in a Brassicaceae Tribe.
Xinyi Guo ... Martin A Lysak
Molecular biology and evolution | VOL. 38
Xinyi Guo, et. al.Xinyi Guo ... Martin A Lysak
17 Dec 2020
Molecular biology and evolution | VOL. 38

Evolutionary Dynamics and Functional Specialization of Plant Paralogs Formed by Whole and Small-Scale Genome Duplications
Lorenzo Carretero-Paulet ... Mario A Fares
Molecular Biology and Evolution | VOL. 29
Lorenzo Carretero-Paulet, et. al.Lorenzo Carretero-Paulet ... Mario A Fares
13 Jul 2012
Molecular Biology and Evolution | VOL. 29

Lycophyte transcriptomes reveal two whole-genome duplications in Lycopodiaceae: Insights into the polyploidization of Phlegmariurus
Zeng-Qiang Xia ... Yue-Hong Yan
Plant Diversity | VOL. 44
Zeng-Qiang Xia, et. al.Zeng-Qiang Xia ... Yue-Hong Yan
27 Aug 2021
Plant Diversity | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Screening synteny blocks in pairwise genome comparisons through integer programming

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics