Abstract

The expectation of published sequence data on Genbank and other sequence databases is not only a requirement for journal publication, but is incredibly valuable for continuing research in organismal evolution and ecology. The process of formatting and submitting sequence data from vouchered specimens is burdensome and tedious. Improperly formatted submissions lead to lengthy delays and frustrations on behalf of the submitters and reviewers. Specimods*1 is a new tool that utilizes the Global Biodiversity Information Facility (GBIF) API to produce formatted source modifier files that will facilitate the batch upload of specimen sequences to Genbank, using the sequence submission platform. Once a template CSV is populated with the specimen catalog numbers, institution codes, SeqID, sequence descriptions, and the sequences, Specimods will pull specimen metadata from GBIF to produce two files: a FASTA file with a complete definition line (SeqID, organism name, sequence title) and the sequences as well as a source modifier file containing specimen metadata. Currently, Specimods supports Internal Transcribed Spacer (ITS) (1, 2, or both), SSUrRNA_18s, and LSUrRNA_28s sequences. The application*2 uses an Express server and Node.js with data linked to a user account stored in a MySQL relational database. This presentation will demonstrate the tool using data from fungal specimens and show how submitting dozens of sequences with metadata can be accomplished in a matter of minutes.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call