Power and Weakness of Repetition – Evaluating the Phylogenetic Signal From Repeatomes in the Family Rosaceae With Two Case Studies From Genera Prone to Polyploidy and Hybridization (Rosa and Fragaria)

Veit Herklotz,Aleš Kovařík,Radka Vozárová,Klaus Olbricht,Marco Groth,Christiane M Ritz,Jana Lunerová,Volker Wissemann,Sebastian Buschmann

doi:10.3389/fpls.2021.738119

Abstract

Plant genomes consist, to a considerable extent, of non-coding repetitive DNA. Several studies showed that phylogenetic signals can be extracted from such repeatome data by using among-species dissimilarities from the RepeatExplorer2 pipeline as distance measures. Here, we advanced this approach by adjusting the read input for comparative clustering indirectly proportional to genome size and by summarizing all clusters into a main distance matrix subjected to Neighbor Joining algorithms and Principal Coordinate Analyses. Thus, our multivariate statistical method works as a “repeatomic fingerprint,” and we proved its power and limitations by exemplarily applying it to the family Rosaceae at intrafamilial and, in the genera Fragaria and Rosa, at the intrageneric level. Since both taxa are prone to hybridization events, we wanted to show whether repeatome data are suitable to unravel the origin of natural and synthetic hybrids. In addition, we compared the results based on complete repeatomes with those from ribosomal DNA clusters only, because they represent one of the most widely used barcoding markers. Our results demonstrated that repeatome data contained a clear phylogenetic signal supporting the current subfamilial classification within Rosaceae. Accordingly, the well-accepted major evolutionary lineages within Fragaria were distinguished, and hybrids showed intermediate positions between parental species in data sets retrieved from both complete repeatomes and rDNA clusters. Within the taxonomically more complicated and particularly frequently hybridizing genus Rosa, we detected rather weak phylogenetic signals but surprisingly found a geographic pattern at a population scale. In sum, our method revealed promising results at larger taxonomic scales as well as within taxa with manageable levels of reticulation, but success remained rather taxon specific. Since repeatomes can be technically easy and comparably inexpensively retrieved even from samples of rather poor DNA quality, our phylogenomic method serves as a valuable alternative when high-quality genomes are unavailable, for example, in the case of old museum specimens.

Highlights

In most eukaryotic genomes, especially in higher plants, the majority of nuclear DNA consists of repetitive elements, which, in total, are referred to as repeatome (Biscotti et al, 2015)
In contrast to Dodsworth et al (2015) and Vitales et al (2020), who used RE input reads in direct proportion to the genome size in order to reflect the proportion of repeat abundance per genomes, we propose here to adjust the read input amount in indirect proportion to the genome size to overcome the biased self-interconnection in graph-based clustering for species with large genomes and high repeat abundance
Samples from 24 Rosaceae species across three subfamilies and 10 tribes were comparatively analyzed with RE, resulting in 352 clusters

Summary

Introduction

Especially in higher plants, the majority of nuclear DNA consists of repetitive elements, which, in total, are referred to as repeatome (Biscotti et al, 2015). Transposable elements (TEs) are highly variable, mostly dispersed throughout the genome (Biscotti et al, 2015; Bourque et al, 2018), and transferred and amplified by DNA (Class II TE) or via an intermediate RNA (Class I TE). Within these TE classes, an amazing variety of types can be classified, and the abundance of certain types differs highly between taxa (Leitch and Leitch, 2008; Biscotti et al, 2015; Wendel et al, 2016). There is significant evidence for the hypothesis that horizontal TE transfer is widespread (Gilbert and Feschotte, 2018; Wallau et al, 2018)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Plant Science	Publication Date: Dec 7, 2021
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Power and Weakness of Repetition – Evaluating the Phylogenetic Signal From Repeatomes in the Family Rosaceae With Two Case Studies From Genera Prone to Polyploidy and Hybridization (Rosa and Fragaria)

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Plant Science

Lead the way for us

Similar Papers

Data_Sheet_1.zip
-
-
--
07 Dec 2021
07 Dec 2021

Phylogenetically aligned component analysis
Michael L Collyer ... Dean C Adams
Methods in Ecology and Evolution | VOL. 12
Michael L Collyer, et. al.Michael L Collyer ... Dean C Adams
04 Nov 2020
Methods in Ecology and Evolution | VOL. 12

Species invasions and the phylogenetic signal in geographical range size
Alex L Pigot ... David W Redding
Global Ecology and Biogeography | VOL. 27
Alex L Pigot, et. al.Alex L Pigot ... David W Redding
01 Sep 2018
Global Ecology and Biogeography | VOL. 27

Weak phylogenetic signal in physiological traits of methane-oxidizing bacteria.
S Krause ... P M Van Bodegom
Journal of Evolutionary Biology | VOL. 27
S Krause, et. al.S Krause ... P M Van Bodegom
03 May 2014
Journal of Evolutionary Biology | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Power and Weakness of Repetition – Evaluating the Phylogenetic Signal From Repeatomes in the Family Rosaceae With Two Case Studies From Genera Prone to Polyploidy and Hybridization (Rosa and Fragaria)

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Plant Science