2021
DOI: 10.1093/bioadv/vbab025
|View full text |Cite
|
Sign up to set email alerts
|

MUNDO: protein function prediction embedded in a multispecies world

Abstract: Motivation Leveraging cross-species information in protein function prediction can add significant power to network-based protein function prediction methods, because so much functional information is conserved across at least close scales of evolution. We introduce MUNDO, a new cross-species co-embedding method that combines a single network embedding method with a co-embedding method to predict functional annotations in a target species, leveraging also functional annotations in a model spe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 39 publications
0
1
0
Order By: Relevance
“…Proteins that have similar functions and very similar protein-protein interactions tend to be evolutionarily conserved across species. By creating a shared embedding space that captures the functional and interaction similarities between proteins within and across species, we can transfer functional annotations from well-studied proteins in one species to their functionally similar counterparts in other species 91 . While embedding-based models have shown promise in leveraging multiple data sources for protein function prediction, they have primarily relied on supervised learning approaches that transfer labels from proteins with known functions to unlabeled ones.…”
Section: Resultsmentioning
confidence: 99%
“…Proteins that have similar functions and very similar protein-protein interactions tend to be evolutionarily conserved across species. By creating a shared embedding space that captures the functional and interaction similarities between proteins within and across species, we can transfer functional annotations from well-studied proteins in one species to their functionally similar counterparts in other species 91 . While embedding-based models have shown promise in leveraging multiple data sources for protein function prediction, they have primarily relied on supervised learning approaches that transfer labels from proteins with known functions to unlabeled ones.…”
Section: Resultsmentioning
confidence: 99%
“…Overall, our work demonstrates that we can create effective joint representations of molecular networks from multiple species that can be easily reused for any within-and cross-species gene classification task. We propose GenePlexusZoo as a general framework where any previous [19][20][21][22], or future approaches for joint network embedding and learning can be adapted to take the place of our implementation to continue improving cross-species network-based gene classification.…”
Section: Plos Computational Biologymentioning
confidence: 99%
“…Some of these methods directly find similar functional modules in the networks of each species; either through graph alignment [ 17 ] or the use of gene families [ 18 ]. Methods such as MUNK [ 19 ], MUNDO [ 20 ], and ETNA [ 21 ] cast genes across pairs of species ( e . g ., human and mouse) into a joint low dimensional embedding space, anchoring the embedding space together with one-to-one-orthologs.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation