2021
DOI: 10.1101/2021.06.03.446934
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale

Abstract: Even though automated functional annotation of genes represents a fundamental step in most genomic and metagenomic workflows, it remains challenging at large scales. Here, we describe a major upgrade to eggNOG-mapper, a tool for functional annotation based on precomputed orthology assignments, now optimized for vast (meta)genomic data sets. Improvements in version 2 include a full update of both the genomes and functional databases to those from eggNOG v5, as well as several efficiency enhancements and new fea… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
218
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 434 publications
(219 citation statements)
references
References 18 publications
1
218
0
Order By: Relevance
“…Rather than relying on single MAGs where the presence of specific genes can be questioned, mOTUpan can robustly quantify this presence as long as highly similar MAGs are available (which is often the case in mediumto-large scale metagenomic project). Notably, it can be used with a variety of genome-encoded traits, and the currently available version has parsers available for: Roary, PPanGGoLiN, eggNOGmapper [22], and mmseqs2 [21], with possibly more to be included later.…”
Section: Benchmarking Motupan Against Ppanggolin For a Prochlorococcus A Genomesetmentioning
confidence: 99%
“…Rather than relying on single MAGs where the presence of specific genes can be questioned, mOTUpan can robustly quantify this presence as long as highly similar MAGs are available (which is often the case in mediumto-large scale metagenomic project). Notably, it can be used with a variety of genome-encoded traits, and the currently available version has parsers available for: Roary, PPanGGoLiN, eggNOGmapper [22], and mmseqs2 [21], with possibly more to be included later.…”
Section: Benchmarking Motupan Against Ppanggolin For a Prochlorococcus A Genomesetmentioning
confidence: 99%
“…In order to investigate if other annotation methods provide better functional predictions, we scanned the sequences annotated to EC 1.1.3.15 with HAMAP and EggNOG predictors (S1 File). HAMAP [46] classifies and annotates proteins using a collection of expert-curated protein family signatures and annotation rules, while EggNOG [22] is a tool based on fast orthology assignments using precomputed clusters and phylogenies. Both methods provided predictions for only a portion of the input sequences (74% HAMAP, 59% EggNOG), indicating that for some of the sequences there was no evidence for either EC 1.1.3.15, or any other functional prediction.…”
Section: Investigation Of Alternative Functional Predictions For Sequences Annotated To Ec 11315 In the Brenda Databasementioning
confidence: 99%
“…Even more proteins with functions yet to be discovered might be hidden among the PLOS COMPUTATIONAL BIOLOGY misannotated sequences. The fields of systems biology [57], metabolic and enzyme engineering [58,59] also rely on accurate annotations, and improved methods for functional annotation are constantly being developed to meet their needs [20,22,60].…”
Section: Plos Computational Biologymentioning
confidence: 99%
See 1 more Smart Citation
“…The genomes were annotated in Prokka, after which the genes were organized using MCL algorithm into core, shell, and singleton clusters (Distance: Euclidean; Linkage: Ward). The core, shell, and singleton genes were separately annotated by BLASTp against the NCBI COG database using eggNOG-mapper (Cantalapiedra et al, 2021). Heatmap based on the annotated COG functions of the core and singleton gene clusters was then plotted in R. The Tettelin best-fit curves of the core-and pan-genomes were constructed using OMCL v1.4 implemented in GET_ HOMOLOGUES pipeline (Contreras-Moreira and Vinuesa, 2013).…”
Section: Pan-genome Analysismentioning
confidence: 99%