Specimods: A web-based tool for producing Genbank submission files for sequenced museum specimens

Biodiversity Information Science and Standards(2022)

Cited 0|Views2
No score
Abstract
The expectation of published sequence data on Genbank and other sequence databases is not only a requirement for journal publication, but is incredibly valuable for continuing research in organismal evolution and ecology. The process of formatting and submitting sequence data from vouchered specimens is burdensome and tedious. Improperly formatted submissions lead to lengthy delays and frustrations on behalf of the submitters and reviewers. Specimods*1 is a new tool that utilizes the Global Biodiversity Information Facility (GBIF) API to produce formatted source modifier files that will facilitate the batch upload of specimen sequences to Genbank, using the sequence submission platform. Once a template CSV is populated with the specimen catalog numbers, institution codes, SeqID, sequence descriptions, and the sequences, Specimods will pull specimen metadata from GBIF to produce two files: a FASTA file with a complete definition line (SeqID, organism name, sequence title) and the sequences as well as a source modifier file containing specimen metadata. Currently, Specimods supports Internal Transcribed Spacer (ITS) (1, 2, or both), SSUrRNA_18s, and LSUrRNA_28s sequences. The application*2 uses an Express server and Node.js with data linked to a user account stored in a MySQL relational database. This presentation will demonstrate the tool using data from fungal specimens and show how submitting dozens of sequences with metadata can be accomplished in a matter of minutes.
More
Translated text
Key words
sequenced museum specimens,genbank submission files,web-based
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined