2010
DOI: 10.1186/1471-2164-11-s5-s9
|View full text |Cite
|
Sign up to set email alerts
|

A machine learning approach for genome-wide prediction of morbid and druggable human genes based on systems-level data

Abstract: BackgroundThe genome-wide identification of both morbid genes, i.e., those genes whose mutations cause hereditary human diseases, and druggable genes, i.e., genes coding for proteins whose modulation by small molecules elicits phenotypic effects, requires experimental approaches that are time-consuming and laborious. Thus, a computational approach which could accurately predict such genes on a genome-wide scale would be invaluable for accelerating the pace of discovery of causal relationships between genes and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
62
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 70 publications
(63 citation statements)
references
References 53 publications
1
62
0
Order By: Relevance
“…Similarly, gene–disease association data has been used before to discover new genes with important roles in disease, with precision estimates ranging from 0.61 to 0.84 [64, 7173]. Interestingly, this is another scenario where sourcing unambiguous negative examples is challenging and has often been framed as a PU learning problem [7173].…”
Section: Discussionmentioning
confidence: 99%
“…Similarly, gene–disease association data has been used before to discover new genes with important roles in disease, with precision estimates ranging from 0.61 to 0.84 [64, 7173]. Interestingly, this is another scenario where sourcing unambiguous negative examples is challenging and has often been framed as a PU learning problem [7173].…”
Section: Discussionmentioning
confidence: 99%
“…PROSPECTR 27 uses 23 sequence-based features and predicts disease genes from OMIM with precision = 0.62 and recall = 0.70 with an AUC of 0.70. The most directly comparable method, presented in Costa et al , 18 utilizes topological features of gene interaction networks to predict both morbidity genes (P=0.66, R=0.65, AUC=0.72) and druggable genes (P=0.75, R=0.78, AUC=0.82). While the majority of other methods utilize sequence-based features, protein interactions, and other genomic networks, our method requires only Gene Ontology annotations and simple bigrams/collocations extracted from biomedical literature.…”
Section: Resultsmentioning
confidence: 99%
“…We also looked up our predicted genes in the results from a previous study on predicting morbid and druggable genes, and 90% (9 out of 10) of our predicted pharmacogenes were also predicted to be morbid (variations cause hereditary human diseases) or druggable. 18 …”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The PPIs of TFs and TGs, on the other hand, were extracted from a integrated network of human gene interactions recently published by our group [11]. …”
Section: Construction and Contentmentioning
confidence: 99%