Concordance and Discordance in the Phylogenomics of the Wrasses and Parrotfishes (Teleostei: Labridae).

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

Phylogenomic analysis of large genome-wide sequence data sets can resolve phylogenetic tree topologies for large species groups, help test the accuracy of and improve resolution for earlier multi-locus studies and reveal the level of agreement or concordance within partitions of the genome for various tree topologies. Here we used a target-capture approach to sequence 1088 single-copy exons for more than 200 labrid fishes together with more than 100 outgroup taxa to generate a new data-rich phylogeny for the family Labridae. Our time-calibrated phylogenetic analysis of exon-capture data pushes the root node age of the family Labridae back into the Cretaceous to about 79 Ma years ago. The monotypic Centrogenys vaigiensis, and the order Uranoscopiformes (stargazers) are identified as the sister lineages of Labridae. The phylogenetic relationships among major labrid subfamilies and within these clades were largely congruent with prior analyses of select mitochondrial and nuclear datasets. However, the position of the tribe Cirrhilabrini (fairy and flame wrasses) showed discordance, resolving either as the sister to a crown julidine clade or alternatively sister to a group formed by the labrines, cheilines and scarines. Exploration of this pattern using multiple approaches leads to slightly higher support for this latter hypothesis, highlighting the importance of genome-level data sets for resolving short internodes at key phylogenetic positions in a large, economically important groups of coral reef fishes. More broadly, we demonstrate how accounting for sources of biological variability from incomplete lineage sorting and exploring systematic error at conflicting nodes can aid in evaluating alternative phylogenetic hypotheses. [coral reefs; divergence time estimation; exon-capture; fossil calibration; incomplete lineage sorting.].

Similar Papers
  • Research Article
  • Cite Count Icon 172
  • 10.1016/j.ympev.2014.11.003
Deep phylogenetic incongruence in the angiosperm clade Rosidae
  • Nov 18, 2014
  • Molecular Phylogenetics and Evolution
  • Miao Sun + 5 more

Deep phylogenetic incongruence in the angiosperm clade Rosidae

  • Research Article
  • Cite Count Icon 17
  • 10.1038/sj.hdy.6800644
Molecular clocks: Closing the gap between rocks and clocks
  • Feb 16, 2005
  • Heredity
  • K Cranston + 1 more

Molecular clocks: Closing the gap between rocks and clocks

  • Research Article
  • Cite Count Icon 84
  • 10.1093/sysbio/syu109
Characterization of the uncertainty of divergence time estimation under relaxed molecular clock models using multiple loci.
  • Dec 11, 2014
  • Systematic Biology
  • Tianqi Zhu + 2 more

Genetic sequence data provide information about the distances between species or branch lengths in a phylogeny, but not about the absolute divergence times or the evolutionary rates directly. Bayesian methods for dating species divergences estimate times and rates by assigning priors on them. In particular, the prior on times (node ages on the phylogeny) incorporates information in the fossil record to calibrate the molecular tree. Because times and rates are confounded, our posterior time estimates will not approach point values even if an infinite amount of sequence data are used in the analysis. In a previous study we developed a finite-sites theory to characterize the uncertainty in Bayesian divergence time estimation in analysis of large but finite sequence data sets under a strict molecular clock. As most modern clock dating analyses use more than one locus and are conducted under relaxed clock models, here we extend the theory to the case of relaxed clock analysis of data from multiple loci (site partitions). Uncertainty in posterior time estimates is partitioned into three sources: Sampling errors in the estimates of branch lengths in the tree for each locus due to limited sequence length, variation of substitution rates among lineages and among loci, and uncertainty in fossil calibrations. Using a simple but analogous estimation problem involving the multivariate normal distribution, we predict that as the number of loci () goes to infinity, the variance in posterior time estimates decreases and approaches the infinite-data limit at the rate of 1/, and the limit is independent of the number of sites in the sequence alignment. We then confirmed the predictions by using computer simulation on phylogenies of two or three species, and by analyzing a real genomic data set for six primate species. Our results suggest that with the fossil calibrations fixed, analyzing multiple loci or site partitions is the most effective way for improving the precision of posterior time estimation. However, even if a huge amount of sequence data is analyzed, considerable uncertainty will persist in time estimates.

  • Research Article
  • Cite Count Icon 59
  • 10.1093/sysbio/syu082
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction
  • Oct 3, 2014
  • Systematic Biology
  • Leonardo De Oliveira Martins + 2 more

Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models.

  • Research Article
  • Cite Count Icon 37
  • 10.1111/j.1365-294x.2011.05008.x
Mitochondrial and nuclear DNA sequences reveal recent divergence in morphologically indistinguishable petrels
  • Feb 16, 2011
  • Molecular Ecology
  • Andreanna J Welch + 2 more

Often during the process of divergence, genetic markers will only gradually obtain the signal of isolation. Studies of recently diverged taxa utilizing both mitochondrial and nuclear data sets may therefore yield gene trees with differing levels of phylogenetic signal as a result of differences in coalescence times. However, several factors can lead to this same pattern, and it is important to distinguish between them to gain a better understanding of the process of divergence and the factors driving it. Here, we employ three nuclear intron loci in addition to the mitochondrial Cytochrome b gene to investigate the magnitude and timing of divergence between two endangered and nearly indistinguishable petrel taxa: the Galapagos (GAPE) and Hawaiian (HAPE) petrels (Pterodroma phaeopygia and P. sandwichensis). Phylogenetic analyses indicated reciprocal monophyly between these two taxa for the mitochondrial data set, but trees derived from the nuclear introns were unresolved. Coalescent analyses revealed effectively no migration between GAPE and HAPE over the last 100,000 generations and that they diverged relatively recently, approximately 550,000 years ago, coincident with a time of intense ecological change in both the Galapagos and Hawaiian archipelagoes. This indicates that recent divergence and incomplete lineage sorting are causing the difference in the strength of the phylogenetic signal of each data set, instead of insufficient variability or ongoing male-biased dispersal. Further coalescent analyses show that gene flow is low even between islands within each archipelago suggesting that divergence may be continuing at a local scale. Accurately identifying recently isolated taxa is becoming increasingly important as many clearly recognizable species are already threatened by extinction.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 26
  • 10.1371/journal.pone.0174712
Phylogeographic patterns of Lygus pratensis (Hemiptera: Miridae): Evidence for weak genetic structure and recent expansion in northwest China.
  • Apr 3, 2017
  • PLOS ONE
  • Li-Juan Zhang + 8 more

Lygus pratensis (L.) is an important cotton pest in China, especially in the northwest region. Nymphs and adults cause serious quality and yield losses. However, the genetic structure and geographic distribution of L. pratensis is not well known. We analyzed genetic diversity, geographical structure, gene flow, and population dynamics of L. pratensis in northwest China using mitochondrial and nuclear sequence datasets to study phylogeographical patterns and demographic history. L. pratensis (n = 286) were collected at sites across an area spanning 2,180,000 km2, including the Xinjiang and Gansu-Ningxia regions. Populations in the two regions could be distinguished based on mitochondrial criteria but the overall genetic structure was weak. The nuclear dataset revealed a lack of diagnostic genetic structure across sample areas. Phylogenetic analysis indicated a lack of population level monophyly that may have been caused by incomplete lineage sorting. The Mantel test showed a significant correlation between genetic and geographic distances among the populations based on the mtDNA data. However the nuclear dataset did not show significant correlation. A high level of gene flow among populations was indicated by migration analysis; human activities may have also facilitated insect movement. The availability of irrigation water and ample cotton hosts makes the Xinjiang region well suited for L. pratensis reproduction. Bayesian skyline plot analysis, star-shaped network, and neutrality tests all indicated that L. pratensis has experienced recent population expansion. Climatic changes and extensive areas occupied by host plants have led to population expansion of L. pratensis. In conclusion, the present distribution and phylogeographic pattern of L. pratensis was influenced by climate, human activities, and availability of plant hosts.

  • Research Article
  • Cite Count Icon 40
  • 10.1016/j.ympev.2019.106646
Estimating uncertainty in divergence times among three-spined stickleback clades using the multispecies coalescent
  • Oct 18, 2019
  • Molecular Phylogenetics and Evolution
  • Bohao Fang + 3 more

Incomplete lineage sorting (ILS) can lead to biased divergence time estimates. To explore if and how ILS has influenced the results of a recent worldwide phylogeny of three-spined sticklebacks (Gasterosteus aculeatus), we estimated divergence times among major clades by applying both a concatenation approach and the multispecies coalescent (MSC) model to single-nucleotide polymorphisms. To further test the influence of different calibration strategies, we applied different calibrations to the root and to younger nodes in addition to the ones used in the previous study. Both the updated calibrations and the application of the MSC model influenced divergence time estimates, sometimes significantly. The new divergence time estimates were more ancient than in the previous study for older nodes, whereas the estimates of younger nodes were not strongly affected by the re-analyses. However, since the applied MSC method employs a simple substitution model and cannot account for changes in population size, we suggest that different analytical approaches and calibration strategies should be used in order to explore uncertainty in divergence time estimates. This study provides a valuable reference timeline for the ages of worldwide three-spined stickleback populations and emphasizes the need to embrace, rather than obscure, uncertainties around divergence time estimates.

  • Research Article
  • Cite Count Icon 71
  • 10.1093/sysbio/syv070
Rumbling Orchids: How To Assess Divergent Evolution Between Chloroplast Endosymbionts and the Nuclear Host.
  • Oct 1, 2015
  • Systematic Biology
  • Oscar Alejandro Pérez-Escobar + 2 more

Phylogenetic relationships inferred from multilocus organellar and nuclear DNA data are often difficult to resolve because of evolutionary conflicts among gene trees. However, conflicting or "outlier" associations (i.e., linked pairs of "operational terminal units" in two phylogenies) among these data sets often provide valuable information on evolutionary processes such as chloroplast capture following hybridization, incomplete lineage sorting, and horizontal gene transfer. Statistical tools that to date have been used in cophylogenetic studies only also have the potential to test for the degree of topological congruence between organellar and nuclear data sets and reliably detect outlier associations. Two distance-based methods, namely ParaFit and Procrustean Approach to Cophylogeny (PACo), were used in conjunction to detect those outliers contributing to conflicting phylogenies independently derived from chloroplast and nuclear sequence data. We explored their efficiency of retrieving outlier associations, and the impact of input data (unit branch length and additive trees) between data sets, by using several simulation approaches. To test their performance using real data sets, we additionally inferred the phylogenetic relationships within Neotropical Catasetinae (Epidendroideae, Orchidaceae), which is a suitable group to investigate phylogenetic incongruence because of hybridization processes between some of its constituent species. A comparison between trees derived from chloroplast and nuclear sequence data reflected strong, well-supported incongruence within Catasetum, Cycnoches, and Mormodes. As a result, outliers among chloroplast and nuclear data sets, and in experimental simulations, were successfully detected by PACo when using patristic distance matrices obtained from phylograms, but not from unit branch length trees. The performance of ParaFit was overall inferior compared to PACo, using either phylograms or unit branch lengths as input data. Because workflows for applying cophylogenetic analyses are not standardized yet, we provide a pipeline for executing PACo and ParaFit as well as displaying outlier associations in plots and trees by using the software R. The pipeline renders a method to identify outliers with high reliability and to assess the combinability of the independently derived data sets by means of statistical analyses.

  • Research Article
  • Cite Count Icon 204
  • 10.1093/sysbio/syw018
Gene Tree Discordance Causes Apparent Substitution Rate Variation.
  • Feb 28, 2016
  • Systematic Biology
  • Fábio K Mendes + 1 more

Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampant in genome-wide data sets, often due to incomplete lineage sorting (ILS). This apparent substitution rate variation is caused when substitutions that occur on discordant gene trees are analyzed in the context of a single, fixed species tree. Such substitutions have to be resolved by proposing multiple substitutions on the species tree, and we therefore refer to this phenomenon as Substitutions Produced by ILS (SPILS). We use simulations to demonstrate that SPILS has a larger effect with increasing levels of ILS, and on trees with larger numbers of taxa. Specific branches of the species trees are consistently, but erroneously, inferred to be longer or shorter, and we show that these branches can be predicted based on discordant tree topologies. Moreover, we observe that fixing a species tree topology when performing tests of positive selection increases the false positive rate, particularly for genes whose discordant topologies are most affected by SPILS. Finally, we use data from multiple Drosophila species to show that SPILS can be detected in nature. Although the effects of SPILS are modest per gene, it has the potential to affect substitution rate variation whenever high levels of ILS are present, particularly in rapid radiations. The problems outlined here have implications for character mapping of any type of trait, and for any biological process that causes discordance. We discuss possible solutions to these problems, and areas in which they are likely to have caused faulty inferences of convergence and accelerated evolution.

  • Research Article
  • Cite Count Icon 184
  • 10.1093/molbev/msu186
Bears in a Forest of Gene Trees: Phylogenetic Inference Is Complicated by Incomplete Lineage Sorting and Gene Flow
  • Jun 5, 2014
  • Molecular Biology and Evolution
  • Verena E Kutschera + 5 more

Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal.

  • Research Article
  • Cite Count Icon 23
  • 10.1043/02-69.1
Molecular Evidence for Definition of Genera in the Oxylobium Group (Fabaceae: Mirbelieae)
  • Jan 24, 2009
  • Systematic Botany
  • Michael D Crisp + 1 more

The circumscription of Oxylobium and related genera has been problematic for nearly 200 years. Traditional definitions of genera in the group have relied on morphological features of the leaves, flower, and fruit that overlap extensively between genera. Therefore sequences of cpDNA (trnL-F intron and spacer) and nrDNA (ITS) were used to estimate the phylogeny of the group in an attempt to redefine the genera as monophyletic groups. Oxylobium sens. str. was found to be a well supported clade in both data sets, with the inclusion of Mirbelia oxylobioides. No other genus in the group was supported by these data, except Gastrolobium sens. lat. Some species groups within Chorizema, Mirbelia, and Podolobium were supported but relationships among these, Oxylobium and Gastrolobium differed significantly between the chloroplast and nuclear data sets. No group supported by the molecular data had a morphological synapomorphy, not even Oxylobium or Gastrolobium. Therefore it may be necessary to adopt a much broader generic concept in this group than has been done previously. Incongruence between the two molecular data sets, and very short internal basal branches in both, suggest a rapid early radiation in this group, possibly combined with hybridization and lineage sorting.

  • Research Article
  • Cite Count Icon 23
  • 10.1071/sb18057
Genome skimming provides well resolved plastid and nuclear phylogenies, showing patterns of deep reticulate evolution in the tropical carnivorous plant genus Nepenthes (Caryophyllales)
  • Jun 12, 2019
  • Australian Systematic Botany
  • Lars Nauheimer + 5 more

Nepenthes is a genus of carnivorous plants consisting of ~160 species that are distributed in the paleotropics. Molecular systematics has so far not been able to resolve evolutionary relationships of most species because of the limited genetic divergence in previous studies. In the present study, we used a genome-skimming approach to infer phylogenetic relationships on the basis of 81 plastid genes and the highly repetitive rRNA (external transcribed spacer (ETS)–26S) for 39 accessions representing 34 species from eight sections. Maximum-likelihood analysis and Bayesian inference were performed separately for the nuclear and the plastid datasets. Divergence-time estimations were conducted on the basis of a relaxed molecular-clock model, using secondary calibration points. The phylogenetic analyses of the nuclear and plastid datasets yielded well resolved and supported phylogenies. Incongruences between the two datasets were detected, suggesting multiple hybridisation events or incomplete lineage sorting in the deeper and more recent evolutionary history of the genus. The inclusion of several known and suspected hybrids in the phylogenetic analysis provided insights into their parentage. Divergence-time estimations placed the crown diversification of Nepenthes in the early Miocene, c. 20 million years ago. This study showed that genome skimming provides well resolved nuclear and plastid phylogenies that provide valuable insights into the complex evolutionary relationships of Nepenthes.

  • PDF Download Icon
  • Preprint Article
  • 10.7287/peerj.preprints.27011v2
Rapid ecological specialization despite constant population sizes
  • Jul 24, 2018
  • Andrinajoro R Rakotoarivelo + 3 more

Background. The bushbuck, Tragelaphus scriptus, is the most widespread and ecologically diverse ungulate species complex within the spiral-horned antelopes. This species was recently found to consist of two genetically divergent but monophyletic Scriptus and Sylvaticus lineages, which are paraphyletic at mitochondrial (mt)DNA owing to an ancient interspecific hybridization event. The Scriptus lineage inhabits the north-western half of the African continent while Sylvaticus is found in the south-eastern half. Here we test hypotheses of historical demography and adaptation in bushbuck using a higher-resolution framework, with four nuclear (MGF, PRKCI, SPTBN, and THY) and three new mitochondrial markers (cytochrome b, 12S rRNA, and 16S rRNA). Methods. Genealogies were reconstructed for the mitochondrial and nuclear data sets, with the latter dated using fossil calibration points. We also inferred the demographic history of Scriptus and Sylvaticus using coalescent-based methods. To obtain an overview of the origins and ancestral colonisation routes of ancestral bushbuck sequences across geographic space, we conducted a discrete Bayesian phylogeographic analysis on our nuclear DNA data set. Results. Both nDNA and mtDNA support previously findings of two genetically divergent Sylvaticus and Scriptus lineages, with no allele sharing despite coming into secondary contact at several geographic locations. The three mtDNA loci confirmed 15 of the previously defined ecotypes, including those with convergent phenotypes. However, the nuclear tree showed less phylogenetic resolution at the more derived parts of the genealogy, possibly due to incomplete lineage sorting of the slower evolving nuclear genome. The only exception to this was the montane ecotype meneliki (Sylvaticus) of the Ethiopian highlands, which formed a monophyletic group at three of four nDNA loci. We dated the coalescence of the two lineages to a common ancestor ∼2.54 million years ago. Both marker sets revealed similar demographic histories of constant population size over time. We show that the bushbuck likely originated in North-East Africa, with Scriptus colonising suitable habitats towards west of the rift and Sylvaticus diffusing from east of the rift into southern Africa. Discussion. Despite lower levels of genetic structure at nuclear loci, we confirmed the independent evolution of the Menelik’s bushbuck relative to the phenotypically similar montane ecotypes in East Africa, adding further weight to previous suggestions of convergent evolution within the bushbuck complex. Perhaps the most surprising result of our analysis was that both Scriptus and Sylvaticus populations remained relatively constant throughout the Pleistocene, which is remarkable given that this was a period of major climatic and tectonic change in Africa, and responsible for driving the evolution of much of the continents extant large mammal diversity.

  • Dissertation
  • 10.31390/gradschool_theses.3348
Bridging Research and Education: A Look into the Evolutionary History of Batfishes and How Museum Exhibits Can be Used to Engage the Public
  • Nov 21, 2014
  • Valerie Derouen

The objective of this thesis was not only to contribute to our understanding of batfishes but to also show interesting and educational ways research can be presented to the general public. The first part of this project examines the evolutionary history of batfishes. Batfishes (Ogcocephalidae) are an understudied, group of marine anglerfishes that are dorsoventrally flattened and have an illicium and esca used to attract prey. Relationships among these taxa, as well as the position of Ogcocephalidae within Lophiiformes, remain poorly understood, with previous studies showing conflicting, and poorly resolved results. The timing of divergence and depth of origination in the water column have also not been explored in any detail. In this study, a concatenated nuclear and mitochondrial dataset was constructed across several anglerfish families to elucidate phylogenetic relationships among all batfish genera, to clarify the placement of Ogcocephaloidei within Lophiiformes, and to estimate divergence times using fossil calibrations. An ancestral state reconstruction was also conducted to examine the history of shifts in preferred habitat depths within batfishes. Phylogenetic analyses supported monophyly of each sub-order within Lophiiformes, and placed Ogcocephaloidei as the sister group to Antennarioidei. Batfish genera were divided into an Eastern Pacific/Western Atlantic clade and an Indo-Pacific clade; Halieutaea was recovered as the sister group to all other batfishes. Based on divergence time estimations and ancestral state reconstructions, Ogcocephalidae is Eocene/Paleocene in age and likely originated on the lower continental shelf/upper continental slope. To bring elements of the batfish research into public outreach, I contributed to the LSU Museum of Natural Science’s stand alone fish exhibit book, “Making a Big Splash with Louisiana Fishes,” which aimed to complement the concepts discussed in the exhibit in an engaging way. In addition to the text included in the exhibit, the book also incorporates ten kids’ activities, a short story for kids, ABCs of Louisiana, and a short guide to Louisiana fishes. Museum exhibits are a great way to incorporate research into public outreach. They not only enrich the surrounding community, but also contribute to an overall understanding of the importance and nature of scientific research.

  • Research Article
  • Cite Count Icon 38
  • 10.2307/2446495
Reappraising adaptive radiation
  • Nov 1, 1998
  • American Journal of Botany
  • Michael J Sanderson

Reappraising adaptive radiation

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant