Abstract

Researchers in ecology commonly use multivariate analyses (e.g. redundancy analysis, canonical correspondence analysis, Mantel correlation, multivariate analysis of variance) to interpret patterns in biological data and relate these patterns to environmental predictors. There has been, however, little recognition of the errors associated with biological data and the influence that these may have on predictions derived from ecological hypotheses. We present a permutational method that assesses the effects of taxonomic uncertainty on the multivariate analyses typically used in the analysis of ecological data. The procedure is based on iterative randomizations that randomly re-assign non identified species in each site to any of the other species found in the remaining sites. After each re-assignment of species identities, the multivariate method at stake is run and a parameter of interest is calculated. Consequently, one can estimate a range of plausible values for the parameter of interest under different scenarios of re-assigned species identities. We demonstrate the use of our approach in the calculation of two parameters with an example involving tropical tree species from western Amazonia: 1) the Mantel correlation between compositional similarity and environmental distances between pairs of sites, and; 2) the variance explained by environmental predictors in redundancy analysis (RDA). We also investigated the effects of increasing taxonomic uncertainty (i.e. number of unidentified species), and the taxonomic resolution at which morphospecies are determined (genus-resolution, family-resolution, or fully undetermined species) on the uncertainty range of these parameters. To achieve this, we performed simulations on a tree dataset from southern Mexico by randomly selecting a portion of the species contained in the dataset and classifying them as unidentified at each level of decreasing taxonomic resolution. An analysis of covariance showed that both taxonomic uncertainty and resolution significantly influence the uncertainty range of the resulting parameters. Increasing taxonomic uncertainty expands our uncertainty of the parameters estimated both in the Mantel test and RDA. The effects of increasing taxonomic resolution, however, are not as evident. The method presented in this study improves the traditional approaches to study compositional change in ecological communities by accounting for some of the uncertainty inherent to biological data. We hope that this approach can be routinely used to estimate any parameter of interest obtained from compositional data tables when faced with taxonomic uncertainty.

Highlights

  • Researchers commonly use multivariate analyses to interpret patterns in species data and relate these patterns to environmental predictors

  • We introduce a method that incorporates the effect of taxonomic uncertainty in the estimation of any parameter of interest obtained from multivariate techniques (e.g. Mantel correlation coefficient or explained variance in Redundancy analysis (RDA), CCA or non-parametric multivariate analysis of variance)

  • We showed how taxonomic uncertainty affected the calculation of two parameters: 1) the Mantel correlation between compositional similarity and environmental distances between pairs of sites, and; 2) the variance explained by environmental predictors in redundancy analysis (RDA)

Read more

Summary

The method

The method outlined here describes a general approach to account for taxonomic uncertainty when computing any parameter of interest from biological data tables This is done by estimating credible bounds under plausible scenarios of re-assigned species identities. Note that when collating inventories are from different researchers, we must rename all unidentified species This is because two researchers can use the same label, e.g. E_spl, even though this name does not necessarily refer to the same species. N matrices are obtained, all of which contain the same number of sites but a variable number of species depending on the resulting re-assignment of morphospecies Multivariate analyses such as non-parametric multivariate analysis of variance, Mantel test, CCA or RDA, can be applied to each of these matrices of species per sites, provided that a matrix of explanatory variables is available. Application of any of these analyses to the n matrices computed in the former steps will allow the estimation of n parameters of interest, allowing calculation of credible bounds for such parameters

Implementation of the procedure in two multivariate analyses
Description of the dataset
Mantel test
Setting the scenarios
Correlation between distance matrices
Results
Taxonomic uncertainty Taxonomic resolution Residuals
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call