2024
DOI: 10.1101/2024.05.29.596415
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Lineage-specific microbial protein prediction enables large-scale exploration of protein ecology within the human gut

Matthias Schmitz,
Nicholas J. Dimonaco,
Thomas Clavel
et al.

Abstract: Microbes use a range of genetic codes and gene structures, yet these are ignored during metagenomic analysis. This causes spurious protein predictions, preventing functional assignment which limits our understanding of ecosystems. To resolve this, we developed a lineage-specific gene prediction approach that uses the correct genetic code based on the taxonomic assignment of genetic fragments, removes partial predictions, and optimises prediction of small proteins. Applied to 9,634 metagenomes and 3,594 genomes… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 91 publications
0
1
0
Order By: Relevance
“…To uncover associations of these taxa with human health conditions, we studied the ecology of each protein within R . intestinale CLA-AA-H216 T ( Figure 3d ), as the most abundant novel taxa, and Blautia intestinihominis CLA-AA-H95 T ( Figure 3e ), as member of a genus associated with both health and disease conditions, using InvestiGUT 42 . Out of the 2,167 proteins encoded by R .…”
Section: Resultsmentioning
confidence: 99%
“…To uncover associations of these taxa with human health conditions, we studied the ecology of each protein within R . intestinale CLA-AA-H216 T ( Figure 3d ), as the most abundant novel taxa, and Blautia intestinihominis CLA-AA-H95 T ( Figure 3e ), as member of a genus associated with both health and disease conditions, using InvestiGUT 42 . Out of the 2,167 proteins encoded by R .…”
Section: Resultsmentioning
confidence: 99%