Studying pathogens degrades BLAST-based pathogen identification

Jacob Beal,Adam Clore,Jeff Manthey

doi:10.1038/s41598-023-32481-z

Jacob Beal, Adam Clore + Show 1 more

Open Access

https://doi.org/10.1038/s41598-023-32481-z

Copy DOI

Abstract

As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST algorithm to determine the best match with sequences in the NCBI nucleic acid and protein databases. Neither BLAST nor any of the NCBI databases, however, are actually designed for biosafety determination. Critically, taxonomic errors or ambiguities in the NCBI nucleic acid and protein databases can also cause errors in BLAST-based taxonomic categorization. With heavily studied taxa and frequently used biotechnology tools, even low frequency taxonomic categorization issues can lead to high rates of errors in biosecurity decision-making. Here we focus on the implications for false positives, finding that BLAST against NCBI’s protein database will now incorrectly categorize a number of commonly used biotechnology tool sequences as the pathogens or toxins with which they have been used. Paradoxically, this implies that problems are expected to be most acute for the pathogens and toxins of highest interest and for the most widely used biotechnology tools. We thus conclude that biosecurity tools should shift away from BLAST against general purpose databases and towards new methods that are specifically tailored for biosafety purposes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Apr 3, 2023
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

Studying pathogens degrades BLAST-based pathogen identification

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

A novel sequence similarity searching and visualization method based on overlappingly translated nucleic acids: the blastNP
Jan C Biro ... Josephine M.K Biro
Medical Hypotheses | VOL. 62
Jan C Biro, et. al.Jan C Biro ... Josephine M.K Biro
03 Feb 2004
Medical Hypotheses | VOL. 62

2 - Nucleic Acid and Protein Sequence Databases
Gary Williams
Genetic Databases | VOL. -
Gary WilliamsGary Williams
01 Jan 1997
Genetic Databases | VOL. -

Unbiased analysis by high throughput sequencing of the viral diversity in fetal bovine serum and trypsin used in cell culture
Léa Gagnieur ... Marc Eloit
Biologicals | VOL. 42
Léa Gagnieur, et. al.Léa Gagnieur ... Marc Eloit
22 Mar 2014
Biologicals | VOL. 42

Cold Adaptation of Zinc Metalloproteases in the Thermolysin Family from Deep Sea and Arctic Sea Ice Bacteria Revealed by Catalytic and Structural Properties and Molecular Dynamics
Bin-Bin Xie ... Yu-Zhong Zhang
Journal of Biological Chemistry | VOL. 284
Bin-Bin Xie, et. al.Bin-Bin Xie ... Yu-Zhong Zhang
01 Apr 2009
Journal of Biological Chemistry | VOL. 284

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Studying pathogens degrades BLAST-based pathogen identification

Abstract

Talk to us

Similar Papers

More From: Scientific Reports