2021
DOI: 10.1186/s13040-021-00239-w
|View full text |Cite
|
Sign up to set email alerts
|

Gene function finding through cross-organism ensemble learning

Abstract: Background Structured biological information about genes and proteins is a valuable resource to improve discovery and understanding of complex biological processes via machine learning algorithms. Gene Ontology (GO) controlled annotations describe, in a structured form, features and functions of genes and proteins of many organisms. However, such valuable annotations are not always reliable and sometimes are incomplete, especially for rarely studied organisms. Here, we present GeFF (Gene Functi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 73 publications
(91 reference statements)
0
1
0
Order By: Relevance
“…The community should envisage novel graph representation learning methods [ 57 , 58 , 59 , 60 ] to densely represent multi-relational structured data following a Linked Open Data vision centered on the integration of several source knowledge graphs or relational databases via automatic entity matching [ 61 ]. Taking inspiration from biology [ 62 , 63 ] and communication networks [ 64 , 65 , 66 , 67 ], we underline the importance of managing dynamic scenarios, tracking knowledge refinements among sentences, and propagating information, which is pivotal when processing lengthy inputs. Segmentation strategies and memory-enhanced encoder–decoder transformers could be inspected in other downstream tasks with long documents and cross-dependencies among chunks, such as claim verification with evidence retrieval [ 68 , 69 ].…”
Section: Limitations and Future Directionsmentioning
confidence: 99%
“…The community should envisage novel graph representation learning methods [ 57 , 58 , 59 , 60 ] to densely represent multi-relational structured data following a Linked Open Data vision centered on the integration of several source knowledge graphs or relational databases via automatic entity matching [ 61 ]. Taking inspiration from biology [ 62 , 63 ] and communication networks [ 64 , 65 , 66 , 67 ], we underline the importance of managing dynamic scenarios, tracking knowledge refinements among sentences, and propagating information, which is pivotal when processing lengthy inputs. Segmentation strategies and memory-enhanced encoder–decoder transformers could be inspected in other downstream tasks with long documents and cross-dependencies among chunks, such as claim verification with evidence retrieval [ 68 , 69 ].…”
Section: Limitations and Future Directionsmentioning
confidence: 99%