2022
DOI: 10.1093/nar/gkac1080
|View full text |Cite
|
Sign up to set email alerts
|

MGnify: the microbiome sequence data analysis resource in 2023

Abstract: The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. The platform provides access to taxonomic assignments and functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, and metagenomic datasets, which are derived from a wide range of different environments. Over the past 3 years, MGnify has not only grown in terms of the number of datasets contained but also increased t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
128
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 172 publications
(128 citation statements)
references
References 29 publications
(33 reference statements)
0
128
0
Order By: Relevance
“…Sequence dataset of Class II effector proteins Initial datasets of Cas12 and Cas9 were taken from (a recently accepted paper, to be cited upon publication) and (17), respectively. Cas13 sequence dataset was constructed by building HMMER (25) sequence profiles for Cas13 groups (26) and using them to search NR (27), UniRef100 (28), MGnify (29), and IMG/VR v4 (30) databases with hmmsearch (25). Only sequences with E-value ≤ 1e-20 were extracted.…”
Section: Methodsmentioning
confidence: 99%
“…Sequence dataset of Class II effector proteins Initial datasets of Cas12 and Cas9 were taken from (a recently accepted paper, to be cited upon publication) and (17), respectively. Cas13 sequence dataset was constructed by building HMMER (25) sequence profiles for Cas13 groups (26) and using them to search NR (27), UniRef100 (28), MGnify (29), and IMG/VR v4 (30) databases with hmmsearch (25). Only sequences with E-value ≤ 1e-20 were extracted.…”
Section: Methodsmentioning
confidence: 99%
“…New features of the former include an improved genome context viewer while the latter has fresh tools for detection of metagenome-derived viral genomes and prediction of their hosts. The update from the popular MGnify resource for metagenomics data ( 52 ), recognising the scale and continued growth of the database, reports interestingly on the adoption of Deep Learning methods to annotate protein sequences with Pfam families ( 53 ). The importance of the microbiome to the colonised host is demonstrated by two databases.…”
Section: New and Updated Databasesmentioning
confidence: 99%
“…For example, the latest release of the UniProtKB database, containing 227M sequences, expands the 2019 version by 90% [30,7]. Metagenomic databases grow even faster: the EMBL-EBI MGnify, including hundreds of millions of sequences produced from high-quality assembled genomes, reported a 48-fold growth since its first release in 2017 [24] [19].…”
Section: Background and Summarymentioning
confidence: 99%
“…Furthermore, the proliferation of metagenomic projects will likely induce a further drop in Pfam coverage. In fact, family annotation levels of metagenomic protein sequence databases are low, typically around 46% [24].…”
Section: Background and Summarymentioning
confidence: 99%