2022
DOI: 10.1128/msphere.00077-22
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Analysis and Data Provenance for 1,113 Bacterial Genome Assemblies

Abstract: The traceability of microbial genomics data to authenticated physical biological materials is not a requirement for depositing these data into public genome databases. This creates significant risks for the reliability and data provenance of these important genomics research resources, the impact of which is not well understood.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 49 publications
1
9
0
Order By: Relevance
“…The treatments containing only U- 12 C Glucose or U- 12 C Glycerol yielded no previously undescribed metabolic phenotypes. We therefore focused our analysis on growth media containing both carbon sources: U- 12 C Glucose and U- 12 C Glycerol, U- 12 C Glucose and U- 13 C Glycerol, or U- 13 C Glucose and U- compositions, enabling the determination of the isotope distribution within the cell during growth.…”
Section: Microbial Growth and Sample Preparation Thementioning
confidence: 87%
See 2 more Smart Citations
“…The treatments containing only U- 12 C Glucose or U- 12 C Glycerol yielded no previously undescribed metabolic phenotypes. We therefore focused our analysis on growth media containing both carbon sources: U- 12 C Glucose and U- 12 C Glycerol, U- 12 C Glucose and U- 13 C Glycerol, or U- 13 C Glucose and U- compositions, enabling the determination of the isotope distribution within the cell during growth.…”
Section: Microbial Growth and Sample Preparation Thementioning
confidence: 87%
“…Although this strategy works well in species that are closely related to these model organisms, it is less effective when applied to species that are highly divergent from the original model . Mismatches due to poor homology result in missing enzymes in the metabolic network, which, without further data refinement, can be misinterpreted as metabolic deficiencies. , This is a critical problem for understanding the evolution of microbes and for making inferences about the metabolic architecture of nonmodel organisms.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Validation of the fungal GAMBIT database using the ATCC Mycology Collection genomes was performed using the Gambit_ Query workflow developed by Theiagen Genomics on Terra. 1 All available fungal genomes were downloaded from the ATCC genome 1 https://github.com/theiagen/public_health_bioinformatics/blob/ PHB-v0.1.0-theiaeuk-manuscript/workflows/standalone_modules/wf_gambit_ query.wdl portal on 2023-03-08 (46)(47)(48). ATCC genomes downloaded from the ATCC genome portal were used exclusively for testing and were not included in the GAMBIT fungal database.…”
Section: Atcc Mycology Genomesmentioning
confidence: 99%
“…ATCC has thus far produced over 4,000 high-quality or closed reference genomes for microbes within the ATCC collection, all under an ISO 9000 controlled quality assurance framework. This presents some dilemmas, however, as the NCBI Assembly database includes (for example) genome references for bacterial strains that have serious gaps in metadata or include substantial errors in their genome assembly when compared to the ATCC Genome Portal reference (17). Reducing discrepancies between genome references for the “same” organism can be aided by improving our ability to include crucial metadata about the origins of and means by which each genome reference is created in-line with the sequence data itself.…”
Section: Introductionmentioning
confidence: 99%