Abstract

Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informative insertion is that the inserted element was fixed in the ancestral population before speciation; if not, polymorphically inserted elements may lead to random distributions of presence/absence states during speciation and possibly to apparently conflicting reconstructions of their ancestry. Fortunately, such misleading fixed cases are relatively rare but nevertheless, need to be considered. Here, we present novel, comprehensive statistical models applicable for (1) analyzing any pattern of rare genomic changes, (2) testing and differentiating conflicting phylogenetic reconstructions based on rare genomic changes caused by incomplete lineage sorting or/and ancestral hybridization, and (3) differentiating between search strategies involving genome information from one or several lineages. When the new statistics are applied, in non-conflicting cases a minimum of three elements present in both of two species and absent in a third group are considered significant support (p<0.05) for the branching of the third from the other two, if all three of the given species are screened equally for genome or experimental data. Five elements are necessary for significant support (p<0.05) if a diagnostic locus derived from only one of three species is screened, and no conflicting markers are detected. Most potentially conflicting patterns can be evaluated for their significance and ancestral hybridization can be distinguished from incomplete lineage sorting by considering symmetric or asymmetric distribution of rare genomic changes among possible tree configurations. Additionally, we provide an R-application to make the new KKSC insertion significance test available for the scientific community at http://retrogenomics.uni-muenster.de:3838/KKSC_significance_test/.

Highlights

  • In their pioneering work, Ryan and Dugaiczyk [1] first proposed using Short INterspersed Element (SINE) insertions as phylogenetic markers with the suggestion: “we submit that the chronology of divergence of primate lines of evolution can be correlated with the timing of insertion of new DNA repeats into the genomes of those primates”

  • We have provided a comprehensive statistical framework for testing the significance of support for phylogenetic hypotheses derived from genome-level presence/absence data such as retroposon insertions and for evaluating such data for different evolutionary scenarios, including polytomy, incomplete lineage sorting, and ancestral hybridization

  • This statistical framework is especially important for high-throughput applications of current and upcoming genome projects due to its treatment of unlimited numbers of testable markers, and is embedded in a user-friendly R-application available to the scientific community online

Read more

Summary

Introduction

Ryan and Dugaiczyk [1] first proposed using Short INterspersed Element (SINE) insertions as phylogenetic markers with the suggestion: “we submit that the chronology of divergence of primate lines of evolution can be correlated with the timing of insertion of new DNA repeats into the genomes of those primates”. Their originally detected insertions were of no direct phylogenetic relevance, subsequent studies fostered this innovative idea, and systematically searched for retroposon insertions as genomic landmarks of phylogeny A synergistic application of both marker systems is the most efficient way to extract historical information from species

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call