2018
DOI: 10.1093/nar/gky977
|View full text |Cite
|
Sign up to set email alerts
|

Genomes OnLine database (GOLD) v.7: updates and new features

Abstract: The Genomes Online Database (GOLD) (https://gold.jgi.doe.gov) is an open online resource, which maintains an up-to-date catalog of genome and metagenome projects in the context of a comprehensive list of associated metadata. Information in GOLD is organized into four levels: Study, Biosample/Organism, Sequencing Project and Analysis Project. Currently GOLD hosts information on 33 415 Studies, 49 826 Biosamples, 313 324 Organisms, 215 881 Sequencing Projects and 174 454 Analysis Projects with a total of 541 met… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
145
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 187 publications
(146 citation statements)
references
References 29 publications
1
145
0
Order By: Relevance
“…If the VR coverage was below this cutoff, it was considered as significantly lower than expected, the TR-VR was colored in blue in this plot, and flagged as "low coverage" if no SNVs were detected in Fig Supplementary Table 1: List of metagenomes mined for DGRs. Metagenomes are associated with samples, ecological category, sample type, and publication based on information from IMG 31 and Gold 32 . The number of genomes in each assembly was estimated based on single-copy marker genes, while the ratio of bp in viral sequences among contigs of 10kb or more was derived from VirSorter detection of viral contigs (see Methods).…”
Section: Figure 2 Diversity and Major Types Of Dgr Targetsmentioning
confidence: 99%
See 1 more Smart Citation
“…If the VR coverage was below this cutoff, it was considered as significantly lower than expected, the TR-VR was colored in blue in this plot, and flagged as "low coverage" if no SNVs were detected in Fig Supplementary Table 1: List of metagenomes mined for DGRs. Metagenomes are associated with samples, ecological category, sample type, and publication based on information from IMG 31 and Gold 32 . The number of genomes in each assembly was estimated based on single-copy marker genes, while the ratio of bp in viral sequences among contigs of 10kb or more was derived from VirSorter detection of viral contigs (see Methods).…”
Section: Figure 2 Diversity and Major Types Of Dgr Targetsmentioning
confidence: 99%
“…vContact2 was run with "diamond" option to generate the PCs, clustering of VCs with cluster_one, and the reference database "ProkaryoticViralRefSeq94-Merged", all other parameters left as default. Metagenome-derived DGRs were also associated to a biome type based on the original sample classification available in the Gold database 32 ( Supplementary Table 1).…”
Section: Selection and Annotation Of Reference Genomes And Metagenomementioning
confidence: 99%
“…In this study, we analyzed all protein-coding genes containing at least one pfam annotated in the genomes of 435 species, with a total of 8,209 pfams identified (see Methods). Eukaryotic species were included in our dataset if they were marked "Complete" by GOLD (Mukherjee et al 2019), and also present in TimeTree (Hedges et al 2006), a phylogenetic database which we used to assign evolutionary ages to pfams. The genes in our dataset were then assigned ages based on the oldest pfam they contained.…”
Section: Resultsmentioning
confidence: 99%
“…We compiled a dataset of whole-genome sequences with a sequencing status of 'complete' from the GOLD database (Mukherjee et al 2019, accessed August 7, 2018, resulting in a list of 1138 unique species. We then downloaded species from the Ensembl Biomart interface databases (Kinsella et al 2011;Zerbino et al 2018Zerbino et al , fungi V40, plant V40, metazoan V40, protest v40 and main V93, accessed between 07/31/2018Zerbino et al -09/12/2018.…”
Section: Data Compilationmentioning
confidence: 99%
“…Additional biological information such as gene-protein-reaction associations is utilized to refine the models. Exponential growth in sequencing has resulted in an "astronomical", or better yet, "genomical", number of sequenced organisms [15]. There are now databases (e.g., KEGG [16], BioCyc [17], and BiGG [18]) that catalogue organism-specific metabolic models.…”
Section: Introductionmentioning
confidence: 99%