2021
DOI: 10.1016/j.jbi.2021.103688
|View full text |Cite
|
Sign up to set email alerts
|

A scalable random walk with restart on heterogeneous networks with Apache Spark for ranking disease-related genes through type-II fuzzy data fusion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…With the uptake of graph representation learning in biomedicine 27 , novel options exist to process networks alongside the input features in a joint ML model, thus approaching an in silico representation of biological regulation. First implementations based on random-walks [28][29][30][31][32][33][34][35][36][37] or graph neural networks (GNN) [38][39][40][41][42][43] show promise in predicting 'disease genes', but are often disease specific, depend on hard-coded and partially biased input data, and do not further explore the properties of predicted (core) genes (Extended Data Fig. 2).…”
Section: Note 1)mentioning
confidence: 99%
“…With the uptake of graph representation learning in biomedicine 27 , novel options exist to process networks alongside the input features in a joint ML model, thus approaching an in silico representation of biological regulation. First implementations based on random-walks [28][29][30][31][32][33][34][35][36][37] or graph neural networks (GNN) [38][39][40][41][42][43] show promise in predicting 'disease genes', but are often disease specific, depend on hard-coded and partially biased input data, and do not further explore the properties of predicted (core) genes (Extended Data Fig. 2).…”
Section: Note 1)mentioning
confidence: 99%
“…Table 1 reports a summary of the main features of the tools proposed in these articles. The table shows that apart from some tools that reports tests only on a multi-core workstation ( [16] , [17] , [18] , [19] ), Spark has been widely used to implement tools aimed at parallelizing the computation on a distributed computing environment. Most of these tools have been specifically devised for, or tested on, a cloud environment ( [20] , [21] , [22] , [23] , [24] , [25] , [26] , [27] , [28] [29] , [30] , [31] , [32] [33] , [34] , [35] , [36] , [37] ).…”
Section: Apache Spark In Life Sciencesmentioning
confidence: 99%
“…In 2021, Joodaki et al [19] proposed RWRHN-FF a novel algorithm aimed at finding genes involved in relevant phenotypes. Results obtained using gene interaction networks are often not accurate due to the high number of false positives.…”
Section: Apache Spark In Life Sciencesmentioning
confidence: 99%
“…Joodaki et al 32 integrated multiple protein/gene networks to overcome the false positive interaction prediction. They built a heterogeneous network based on gene-gene associations, disease–disease associations, and disease–gene associations.…”
Section: Related Workmentioning
confidence: 99%
“…As mentioned above, the current studies have several limitations, summarized in the following points. First, most studies have developed methods to predict the genes related to diseases, but a few of these methods were designed for PD gene prediction 18 22 , 32 , 35 . Second, some of these PD methods identified only protein genes related to PD and ignored lncRNA genes, although lncRNAs are critical for improving our understanding and diagnosing different diseases 17 , 23 – 25 .…”
Section: Related Workmentioning
confidence: 99%