Taxonomy has long been a cornerstone issue in the paleontological collections community, especially as digitization has enabled us to share data at a global scale. Communities of practice such as the Paleo Data Working Group (PDWG, Krimmel et al. 2021) have made concerted efforts to understand and evaluate challenges with the taxonomic information published via fossil occurrence records (Little et al. 2021, Little et al. 2022). Since 2020, PDWG has been leading discussions to define the nuances of challenges associated with fossil taxonomy, to look at the landscape of cyberinfrastructure tools available to tackle these challenges, and to develop community guidelines. This work has led to a deeper understanding of the full range of challenges surrounding the use of fossil taxonomy, from local practices and systems to representation at the data aggregator level. Here we share our initial strategy for tackling challenges with fossil data, particularly taxonomy, centered on developing a reusable framework for assessing and addressing issues so that data can be made more discoverable and usable for transformative and translational research. An essential element of this approach is the coordination of efforts across stakeholder communities. Through these collaborations, we ultimately seek to improve systems for handling and curating taxonomic data, increase the quality and discoverability of taxonomic data presented by aggregators, and better integrate data provided by fossil occurrence records into the research lifecycle. The required coordination will be driven through the development of a partner network from across the global bio- and geo- science data ecosystem. This network is composed of representatives with a variety of perspectives and roles, including participation from Catalogue of Life, Global Biodiversity Information Facility, and Biodiversity Information Standards (TDWG), as well as paleontology collections professionals (led by PDWG), informaticians, researchers, and other end-users. Integrating the existing PDWG community of practice into this larger network allows us to learn from work already underway in the landscape. In addition, recognizing that the paleo community is not alone in these challenges, we believe that a successful strategy for this network is to create processes and envision solutions that can be adapted by other disciplines with similar needs. Our strategy underscores the need for the combined efforts of both community engagement and evaluation of the existing technical landscape. Through this work we aim to establish an enhanced network of FAIR (Findable, Accessible, Interoperable, Reusable; Wilkinson et al. 2016) and research-ready fossil data, improving the support for and quality of fossil taxonomy data along the way.
Read full abstract