Abstract

BackgroundAmongst the most commonly used molecular markers for plant phylogenetic studies are the nuclear ribosomal internal transcribed spacers (ITS). Intra-individual variability of these multicopy regions is a very common phenomenon in plants, the causes of which are debated in literature. Phylogenetic reconstruction under these conditions is inherently difficult. Our approach is to consider this problem as a special case of the general biological question of how to infer the characteristics of hosts (represented here by plant individuals) from features of their associates (represented by cloned sequences here).ResultsSix general transformation functions are introduced, covering the transformation of associate characters to discrete and continuous host characters, and the transformation of associate distances to host distances. A pure distance-based framework is established in which these transformation functions are applied to ITS sequences collected from the angiosperm genera Acer, Fagus and Zelkova. The formulae are also applied to allelic data of three different loci obtained from Rosa spp. The functions are validated by (1) phylogeny-independent measures of treelikeness; (2) correlation with independent host characters; (3) visualization using splits graphs and comparison with published data on the test organisms. The results agree well with these three measures and the datasets examined as well as with the theoretical predictions and previous results in the literature. High-quality distance matrices are obtained with four of the six transformation formulae. We demonstrate that one of them represents a generalization of the Sørensen coefficient, which is widely applied in ecology.ConclusionBecause of their generality, the transformation functions may be applied to a wide range of biological problems that are interpretable in terms of hosts and associates. Regarding cloned sequences, the formulae have a high potential to accurately reflect evolutionary relationships within angiosperm genera, and to identify hybrids and ancestral taxa. These results corroborate earlier ones which showed that treelikeness measures are a valuable tool in comparative studies of biological distance functions.

Highlights

  • Amongst the most commonly used molecular markers for plant phylogenetic studies are the nuclear ribosomal internal transcribed spacers (ITS)

  • Distance quality and correlation with morphology Correlations of the transformed distances with distances derived from associate numbers (SIZ) distances, which were designed for detecting a potential bias related to sampling size, were statistically significant in some cases (SIZ distance matrices are provided in Additional file 1; full correlation matrices, including Spearman correlation values and permutational probabilities as calculated with CADM, are provided in Additional file 2)

  • In Acer, a significant correlation between "phylogenetic Bray-Curtis" distance transformation (PBC) and SIZ distances was observed if a sampling size threshold of 2 was applied and gaps were treated as missing character states; the correlation disappeared if the minimum number of associates required was incremented by 1

Read more

Summary

Introduction

Amongst the most commonly used molecular markers for plant phylogenetic studies are the nuclear ribosomal internal transcribed spacers (ITS). Our approach is to consider this problem as a special case of the general biological question of how to infer the characteristics of hosts (represented here by plant individuals) from features of their associates (represented by cloned sequences here) Multicopy gene regions such as the nuclear ribosomal DNA spacers may exhibit significant heterogeneity within the same individual. Grimm et al [31] inferred a more detailed molecular phylogenetic hypothesis than would have been possible by using the primary (untransformed) sequence data, supporting the phylogenetic scenario predicted earlier [23]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call