FloraTraiter: Automated parsing of traits from descriptive biodiversity literature.

Ryan A Folk,Robert P Guralnick,Raphael T Lafrance

doi:10.1002/aps3.11563

Ryan A Folk, Robert P Guralnick + Show 1 more

Open Access

https://doi.org/10.1002/aps3.11563

Copy DOI

Abstract

Plant trait data are essential for quantifying biodiversity and function across Earth, but these data are challenging to acquire for large studies. Diverse strategies are needed, including the liberation of heritage data locked within specialist literature such as floras and taxonomic monographs. Here we report FloraTraiter, a novel approach using rule-based natural language processing (NLP) to parse computable trait data from biodiversity literature. FloraTraiter was implemented through collaborative work between programmers and botanical experts and customized for both online floras and scanned literature. We report a strategy spanning optical character recognition, recognition of taxa, iterative building of traits, and establishing linkages among all of these, as well as curational tools and code for turning these results into standard morphological matrices. Over 95% of treatment content was successfully parsed for traits with <1% error. Data for more than 700 taxa are reported, including a demonstration of common downstream uses. We identify strategies, applications, tips, and challenges that we hope will facilitate future similar efforts to produce large open-source trait data sets for broad community reuse. Largely automated tools like FloraTraiter will be an important addition to the toolkit for assembling trait data at scale.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FloraTraiter: Automated parsing of traits from descriptive biodiversity literature.

Abstract

Talk to us

Similar Papers

More From: Applications in Plant Sciences

Lead the way for us

Journal: Applications in Plant Sciences	Publication Date: Jan 1, 2024
License type: CC BY-NC 4.0

Similar Papers

Scientific floras can be reliable sources for some trait data in a system with poor coverage in global trait databases
Vanessa Cutts ... Adam C Algar
Journal of Vegetation Science | VOL. 32
Vanessa Cutts, et. al.Vanessa Cutts ... Adam C Algar
01 May 2021
Journal of Vegetation Science | VOL. 32

Toward a Functional Trait Approach to Bee Ecology.
Madeleine M Ostwald ... Katja C Seltmann
Ecology and evolution | VOL. 14
Madeleine M Ostwald, et. al.Madeleine M Ostwald ... Katja C Seltmann
01 Oct 2024
Ecology and evolution | VOL. 14

Discussion of the Method for Constructing Animal Traits
Jiangning Wang ... Yan Han
Biodiversity Information Science and Standards | VOL. 2
Jiangning Wang, et. al.Jiangning Wang ... Yan Han
25 Apr 2018
Biodiversity Information Science and Standards | VOL. 2

Bridging gaps in demographic analysis with phylogenetic imputation.
Tamora D James ... Dylan Z Childs
Conservation Biology | VOL. 35
Tamora D James, et. al.Tamora D James ... Dylan Z Childs
21 Jan 2021
Conservation Biology | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FloraTraiter: Automated parsing of traits from descriptive biodiversity literature.

Abstract

Talk to us

Similar Papers

More From: Applications in Plant Sciences