A simple way to improve multivariate analyses of paleoecological data sets

John Alroy

doi:10.1017/pab.2014.21

Abstract

AbstractMultivariate methods such as cluster analysis and ordination are basic to paleoecology, but the messy nature of fossil occurrence data often makes it difficult to recover clear patterns. A recently described faunal similarity index based on the Forbes coefficient improves results when its complement is employed as a distance metric. This index involves adding terms to the Forbes equation and ignoring one of the counts it employs (that of species found in neither of the samples under consideration). Analyses of simulated data matrices demonstrate its advantages. These matrices include large and small samples from two partially overlapping species pools. In a cluster analysis, the widely used Dice coefficient and the Euclidean distance metric both create groupings that reflect sample size, the Simpson index suggests large differences that do not exist, and the corrected Forbes index creates groupings based strictly on true faunal overlap. In a principal coordinates analysis (PCoA) the Forbes index almost removes the sample-size signal but other approaches create a second axis strongly dominated by sample size. Meanwhile, species lists of late Pleistocene mammals from the United States capture biogeographic signals that standard ordination methods do recover, but the adjusted Forbes coefficient spaces the points out more sensibly. Finally, when biome-scale lists for living mammals are added to the data set and extinct species are removed, correspondence analysis misleadingly separates out the biome lists, and PCoA based on the Dice coefficient places them to the edge of the cloud of fossil assemblage data points. PCoA based on the Forbes index places them in more reasonable positions. Thus, only the adjusted Forbes index is able to recover true biological patterns. These results suggest that the index may be useful in analyzing not only paleontological data sets but any data set that includes species lists having highly variable lengths.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A simple way to improve multivariate analyses of paleoecological data sets

Abstract

Talk to us

Similar Papers

More From: Paleobiology

Lead the way for us

Journal: Paleobiology	Publication Date: Feb 24, 2015
Citations: 13

Similar Papers

Statistical analysis of neural organization
R P Erickson ... J L Rodgers
Journal of Neurophysiology | VOL. 70
R P Erickson, et. al.R P Erickson ... J L Rodgers
01 Dec 1993
Journal of Neurophysiology | VOL. 70

Deep Learning Improves Speed and Accuracy of Prostate Gland Segmentations on Magnetic Resonance Imaging for Targeted Biopsy.
Simon John Christoph Soerensen ... Mirabela Rusu
Journal of Urology | VOL. 206
Simon John Christoph Soerensen, et. al.Simon John Christoph Soerensen ... Mirabela Rusu
21 Apr 2021
Journal of Urology | VOL. 206

Applied Computational Topology for Point Clouds and Sparse Timeseries Data

-

01 Jan 2017
01 Jan 2017

Methods for the quantification of assemblage zones based on multivariate analysis of weighted and unweighted data
James C Brower ... Ted S Dyman
Computers and Geosciences | VOL. 4
James C Brower, et. al.James C Brower ... Ted S Dyman
01 Jan 1978
Computers and Geosciences | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A simple way to improve multivariate analyses of paleoecological data sets

Abstract

Talk to us

Similar Papers

More From: Paleobiology