Abstract

The diversity of biological and ecological characteristics of organisms, and the underlying genetic patterns and processes of speciation, makes the development of universally applicable genetic species delimitation methods challenging. Many approaches, like those incorporating the multispecies coalescent, sometimes delimit populations and overestimate species numbers. This issue is exacerbated in taxa with inherently high population structure due to low dispersal ability, and in cryptic species resulting from nonecological speciation. These taxa present a conundrum when delimiting species: analyses rely heavily, if not entirely, on genetic data which over split species, while other lines of evidence lump. We showcase this conundrum in the harvester Theromaster brunneus, a low dispersal taxon with a wide geographic distribution and high potential for cryptic species. Integrating morphology, mitochondrial, and sub-genomic (double-digest RADSeq and ultraconserved elements) data, we find high discordance across analyses and data types in the number of inferred species, with further evidence that multispecies coalescent approaches over split. We demonstrate the power of a supervised machine learning approach in effectively delimiting cryptic species by creating a “custom” training data set derived from a well-studied lineage with similar biological characteristics as Theromaster. This novel approach uses known taxa with particular biological characteristics to inform unknown taxa with similar characteristics, using modern computational tools ideally suited for species delimitation. The approach also considers the natural history of organisms to make more biologically informed species delimitation decisions, and in principle is broadly applicable for taxa across the tree of life.

Highlights

  • Organismal diversity is underpinned by diversity in life history and ecological characteristics among taxa, which in turn produce different underlying genetic patterns at the population and species levels [1,2,3,4,5]

  • Derkarabetian et al Frontiers in Zoology (2022) 19:8 multiple empirical studies have shown that commonly used multispecies coalescent (MSC) models can oversplit species level diversity in low dispersal taxa because such systems violate the underlying assumption of panmixia [9,10,11,12,13,14], a sentiment echoed in theoretical literature [15]

  • Regardless of ongoing debates (e.g., [16]), we argue that MSC implementations taken at face value, and with well-resolved genomic or sub-genomic data, have strong potential to inflate species numbers in low dispersal systems (e.g., [17])

Read more

Summary

Introduction

Organismal diversity is underpinned by diversity in life history and ecological characteristics among taxa, which in turn produce different underlying genetic patterns at the population and species levels [1,2,3,4,5]. The underlying diversity in speciation processes challenges the idea that any single genetic species delimitation model can be universally applicable. Derkarabetian et al Frontiers in Zoology (2022) 19:8 multiple empirical studies have shown that commonly used multispecies coalescent (MSC) models can oversplit species level diversity in low dispersal taxa because such systems violate the underlying assumption of panmixia [9,10,11,12,13,14], a sentiment echoed in theoretical literature [15]. Some lines of evidence cannot be feasibly studied, while others are clearly conservative This leads to a fundamental conundrum—how can we rigorously delimit species when genetic analyses are biased to inflate, and other evidence is inaccessible or lumps evolutionarily significant diversity?

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call