2023
DOI: 10.1038/s41598-023-32481-z
|View full text |Cite
|
Sign up to set email alerts
|

Studying pathogens degrades BLAST-based pathogen identification

Abstract: As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST algorithm to determine the best match with sequences in the NCBI nucleic acid and protein databases. Neither BLAST nor any of the NCBI databases, however, are actually designed for biosafety determination. Critical… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…The more accessible ( Wang and Zhang, 2019 ) and generally useful synbio potentially is regarded to be ( Sun et al, 2022 ), the less likely it is that prohibition will remain an effective tool. Decades-old bioinformatics resources originally developed to compare gene sequences, such as BLAST, have been re-used, with mixed results, as biosafety tools to identify pathogens ( Beal et al, 2023 ). Newer tools, such as machine learning-based topic models, enable spotting trends across a wide set of biosafety research publications ( Guan et al, 2022 ).…”
Section: Resultsmentioning
confidence: 99%
“…The more accessible ( Wang and Zhang, 2019 ) and generally useful synbio potentially is regarded to be ( Sun et al, 2022 ), the less likely it is that prohibition will remain an effective tool. Decades-old bioinformatics resources originally developed to compare gene sequences, such as BLAST, have been re-used, with mixed results, as biosafety tools to identify pathogens ( Beal et al, 2023 ). Newer tools, such as machine learning-based topic models, enable spotting trends across a wide set of biosafety research publications ( Guan et al, 2022 ).…”
Section: Resultsmentioning
confidence: 99%
“…Addressing this issue required weeks of SME effort to curate genomes for new target organisms and build exclusion lists for genomes that contained engineering. In addition, recent studies have highlighted the pitfalls of using general purpose sequence repositories such as NCBI to train screening systems . Consequently, new databases and methods are needed to aggregate and curate natural sequence data for the purposes of biosecurity.…”
Section: Resultsmentioning
confidence: 99%
“…This sequence analysis module is based on techniques from malware detection that were later adapted for pathogenic DNA screening as part of BBN’s FAST-NA tool . BGAF includes whole-genome assembly with Abyss and Prymetime (our custom hybrid assembly pipeline) taxonomic classification and read mapping with FAST-NA, and annotation with BLAST.…”
Section: Methodsmentioning
confidence: 99%
“…Other tools (eg, sequence databases) may have been created for other purposes and may need to be reconfigured or curated for use in biosecurity regimes. 27 Some tools needed by biosecurity regimes are not yet fully developed (eg, algorithms to identify harmful functions from primary DNA or protein sequences). When developing tools, it is vital to take into account the need to avoid unduly impeding biological research and development and to manage any information hazards generated.…”
Section: A More Structured Approach To Designing Biosecurity Regimesmentioning
confidence: 99%
“…Commercial providers of DNA synthesis have noted the challenges of implementing taxonomy-based regulations for sequence-based products. 26 , 27 Ascertaining whether a given order matches something to be controlled is difficult even for the most qualified companies. Taxonomic lists have also been used as a tool to narrow oversight of relevant biological research.…”
Section: Introductionmentioning
confidence: 99%