A mixture framework for inferring ancestral gene orders

Yiwei Zhang,Jijun Tang,Fei Hu

doi:10.1186/1471-2164-13-s1-s7

Yiwei Zhang, Jijun Tang + Show 1 more

Open Access

https://doi.org/10.1186/1471-2164-13-s1-s7

Copy DOI

Abstract

BackgroundInferring gene orders of ancestral genomes has the potential to provide detailed information about the recent evolution of species descended from them. Current popular tools to infer ancestral genome data (such as GRAPPA and MGR) are all parsimony-based direct optimization methods with the aim to minimize the number of evolutionary events. Recently a new method based on the approach of maximum likelihood is proposed. The current implementation of these direct optimization methods are all based on solving the median problems and achieve more accurate results than the maximum likelihood method. However, both GRAPPA and MGR are extremely time consuming under high rearrangement rates. The maximum likelihood method, on the contrary, runs much faster with less accurate results.ResultsWe propose a mixture method to optimize the inference of ancestral gene orders. This method first uses the maximum likelihood approach to identify gene adjacencies that are likely to be present in the ancestral genomes, which are then fixed in the branch-and-bound search of median calculations. This hybrid approach not only greatly speeds up the direct optimization methods, but also retains high accuracy even when the genomes are evolutionary very distant.ConclusionsOur mixture method produces more accurate ancestral genomes compared with the maximum likelihood method while the computation time is far less than that of the parsimony-based direct optimization methods. It can effectively deal with genome data of relatively high rearrangement rates which is hard for the direct optimization methods to solve in a reasonable amount of time, thus extends the range of data that can be analyzed by the existing methods.

Highlights

Inferring gene orders of ancestral genomes has the potential to provide detailed information about the recent evolution of species descended from them
We propose a mixture method to enhance the inference of ancestral gene orders
We randomly select a number of gene adjacencies from the ancestral genome and fix them to run the median calculation

Summary

Introduction

Inferring gene orders of ancestral genomes has the potential to provide detailed information about the recent evolution of species descended from them. The current implementation of these direct optimization methods are all based on solving the median problems and achieve more accurate results than the maximum likelihood method. Both GRAPPA and MGR are extremely time consuming under high rearrangement rates. Hannenhalli and Penvzner [2] provided the first polynomial order data and the ancestral gene orders are reconstructed to maximize the overall probability This method is much faster than the direct optimization methods under high evolution rearrangement event rate, but the accuracy of the reconstructed gene order is lower than that of the direct optimization methods. For those datasets that are previously too difficult for existing methods, we will be able to analyze them within a reasonable time frame with very high accuracy

Methods

Results

Conclusion