2021
DOI: 10.1093/nargab/lqab113
|View full text |Cite
|
Sign up to set email alerts
|

Supervised learning with word embeddings derived from PubMed captures latent knowledge about protein kinases and cancer

Abstract: Inhibiting protein kinases (PKs) that cause cancers has been an important topic in cancer therapy for years. So far, almost 8% of >530 PKs have been targeted by FDA-approved medications, and around 150 protein kinase inhibitors (PKIs) have been tested in clinical trials. We present an approach based on natural language processing and machine learning to investigate the relations between PKs and cancers, predicting PKs whose inhibition would be efficacious to treat a certain cancer. Our approach represen… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 49 publications
0
3
0
Order By: Relevance
“…This first implementation returns data from Ravenmehr et al. ( 49 ), which predicts a relationship between specific cancers and kinases. Incorporating data to Pharos allows scientists to have access to all the visualizations and subset analysis tools that Pharos offers, for their own datasets.…”
Section: Discussionmentioning
confidence: 99%
“…This first implementation returns data from Ravenmehr et al. ( 49 ), which predicts a relationship between specific cancers and kinases. Incorporating data to Pharos allows scientists to have access to all the visualizations and subset analysis tools that Pharos offers, for their own datasets.…”
Section: Discussionmentioning
confidence: 99%
“…The corpus used to test our proposal is composed of 10,584,195 abstracts and titles published between January, 2010 and November, 2020 and available in PubMed; they were downloaded from the FTP site of the National Center for Biotechnology Information (NCBI) by using Marea, a software 13 1 that automatically parses the annual baseline and daily file-updates, p rovided i n t he f orm o f m etadata by N CBI, a nd extract the PubMed ID and year of publication of each paper.…”
Section: Methodsmentioning
confidence: 99%
“…Despite these efforts, nearly 164 out of the 534 known kinases remain relatively understudied and are referred to as “dark” kinases by the NIH Illuminating the Druggable Genome consortium (IDG) ( Berginski et al, 2021 ; Gillespie et al, 2022 ; Kelleher et al, 2023 ). Characterizing the functions of these dark kinases is crucial because they work in conjunction with other well-studied kinases in signaling pathways and are also frequently mutated, or abnormally expressed, in human diseases such as cancers ( Berginski et al, 2021 ; Brognard & Hunter, 2011 ; Collins et al, 2018 ; Ravanmehr et al, 2021 ; Soleymani et al, 2022 ). While considerable progress has been made in illuminating the functions of several dark kinases, placing these kinases in a pathway or a cell signaling network context remains a major bioinformatics challenge.…”
Section: Introductionmentioning
confidence: 99%