Abstract

Phylogenetic studies aim to discover evolutionary relationships and histories. These studies are based on similarities of morphological characters and molecular sequences. Currently, widely accepted phylogenetic approaches are based on multiple sequence alignments, which analyze shared gene datasets and concatenate/coalesce these results to a final phylogeny with maximum support. However, these approaches still have limitations, and often have conflicting results with each other. Reconstructing ancestral genomes helps us understand mechanisms and corresponding consequences of evolution. Most existing genome level phylogeny and ancestor reconstruction methods can only process simplified real genome datasets or simulated datasets with identical genome content, unique genome markers, and limited types of evolutionary events. Here, we provide an alternative way to resolve phylogenetic problems based on analyses of real genome data. We use phylogenetic signals from all types of genome level evolutionary events, and overcome the conflicting issues existing in traditional phylogenetic approaches. Further, we build an automated computational pipeline to reconstruct phylogenies and ancestral genomes for two high-resolution real yeast genome datasets. Comparison results with recent studies and publications show that we reconstruct very accurate and robust phylogenies and ancestors. Finally, we identify and analyze the conserved syntenic blocks among reconstructed ancestral genomes and present yeast species.

Highlights

  • Phylogenetic studies used to be the domain of morphological area, and were based on outward appearances and internal structures[1]

  • We first used our improved approach MLWD to construct the phylogeny for the first yeast genomes dataset with 11 species, which are available in Yeast Genome Order Browser (YGOB) (Version 3 April 2009)

  • We provide an independent and alternative way to build the phylogenies for real genome datasets, and eliminate the conflicting issues in traditional multiple sequence alignment based approaches

Read more

Summary

Introduction

Phylogenetic studies used to be the domain of morphological area, and were based on outward appearances and internal structures[1]. The widely accepted phylogenetic approach to alleviate these conflicting issues is to analyze shared gene datasets, and concatenate/coalesce their results from multiple sequence alignments to obtain a final phylogeny with the maximum support[4,5,6]. Gene order based phylogeny reconstruction approaches obtain phylogenetic signals from genome level evolutionary events, and can bypass the troublesome multiple sequence alignment step in traditional methods[19,20,22]. For current computational ancestral reconstruction methods, only ANGES38, Gapped Adjacency[26], and MGRA216 are reported to be capable of handling non-identical genome content and all types of evolutionary events. They still suffer from the issues of low-resolution, accuracy, and robustness. Vakirlis et al sequenced 10 non-WGD species from the Lachancea genus and developed software AnChro to reconstruct their ancestral genomes, which contained 4446 to 4799 genes[5]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call