2008
DOI: 10.1002/prot.21989
|View full text |Cite
|
Sign up to set email alerts
|

An integrated approach to inferring gene–disease associations in humans

Abstract: One of the most important tasks of modern bioinformatics is the development of computational tools that can be used to understand and treat human disease. To date, a variety of methods have been explored and algorithms for candidate gene prioritization are gaining in their usefulness. Here, we propose an algorithm for detecting gene-disease associations based on the human protein-protein interaction network, known gene-disease associations, protein sequence, and protein functional information at the molecular … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
132
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 169 publications
(132 citation statements)
references
References 62 publications
0
132
0
Order By: Relevance
“…In which, the problem is considered as a classification one, where a classifier is learned from training data; then the learned classifier is used to predict whether or not a test/candidate gene is a disease gene. Briefly, at the early, machine learningbased studies usually approached disease gene prediction as a binary classification problem [9], where the learning samples are comprised of positive training samples and negative training samples [9] such as decision trees (DT) [10,11] k-nearest neighbor (kNN) [12], naive Bayesian classifier [13,14], binary support vector machine classifier [15][16][17], artificial neural network (ANN) techniques [18] and random forest (RF) [9]. In these binary classifier-based methods, positive training samples are constructed from known disease genes, whereas negative training samples are the remaining which are not known to be associated with diseases.…”
Section: Introductionmentioning
confidence: 99%
“…In which, the problem is considered as a classification one, where a classifier is learned from training data; then the learned classifier is used to predict whether or not a test/candidate gene is a disease gene. Briefly, at the early, machine learningbased studies usually approached disease gene prediction as a binary classification problem [9], where the learning samples are comprised of positive training samples and negative training samples [9] such as decision trees (DT) [10,11] k-nearest neighbor (kNN) [12], naive Bayesian classifier [13,14], binary support vector machine classifier [15][16][17], artificial neural network (ANN) techniques [18] and random forest (RF) [9]. In these binary classifier-based methods, positive training samples are constructed from known disease genes, whereas negative training samples are the remaining which are not known to be associated with diseases.…”
Section: Introductionmentioning
confidence: 99%
“…As benchmark data, we used the PPI network data in yeast from Collins et al [20] and also the one in human from Radivojac et al [21]. These networks contain one large connected component and a number of small components.…”
Section: Benchmark Data Setupmentioning
confidence: 99%
“…Protein-protein interaction (PPI) links connect pairs of genes in accordance with combined physical interaction data collected from HPRD, the Online Predicted Human Interaction Database (OPHID), and studies by Rual [7] and Stelzl [8]. Further details about these datasets, which are publicly available, can be found in [9].…”
Section: B Disease-gene Networkmentioning
confidence: 99%