2012
DOI: 10.1109/tcbb.2012.50
|View full text |Cite
|
Sign up to set email alerts
|

Hash Subgraph Pairwise Kernel for Protein-Protein Interaction Extraction

Abstract: Extracting protein-protein interaction (PPI) from biomedical literature is an important task in biomedical text mining (BioTM). In this paper, we propose a hash subgraph pairwise (HSP) kernel-based approach for this task. The key to the novel kernel is to use the hierarchical hash labels to express the structural information of subgraphs in a linear time. We apply the graph kernel to compute dependency graphs representing the sentence structure for protein-protein interaction extraction task, which can efficie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(6 citation statements)
references
References 20 publications
0
6
0
Order By: Relevance
“…Also, by including complementary gene-disease association data resources, we anticipate to increase the prediction accuracy in future research. We expect to extend our work in several related research topics, including (1) integration of additional supervised information (e.g., key words for PubMed abstracts) to make LDA generate more controllable and interpretable topics [ 62 – 64 ]; (2) integration of more comprehensive association databases among disease, drug, and gene (e.g., HPRD [ 65 ] and DrugBank [ 66 ]) to construct more complete base association networks; (3) a framework to automatically extract such disease-specific association network so that such analysis can be extended to each disease topic; (4) additional network-based investigation of the relationships among disease, drug, and gene at other network levels such as module subnetwork identification; and (5) investigation on possible ways to improve the network by assigning weights or confidence values to different types of associations or associations from different sources.…”
Section: Discussionmentioning
confidence: 99%
“…Also, by including complementary gene-disease association data resources, we anticipate to increase the prediction accuracy in future research. We expect to extend our work in several related research topics, including (1) integration of additional supervised information (e.g., key words for PubMed abstracts) to make LDA generate more controllable and interpretable topics [ 62 – 64 ]; (2) integration of more comprehensive association databases among disease, drug, and gene (e.g., HPRD [ 65 ] and DrugBank [ 66 ]) to construct more complete base association networks; (3) a framework to automatically extract such disease-specific association network so that such analysis can be extended to each disease topic; (4) additional network-based investigation of the relationships among disease, drug, and gene at other network levels such as module subnetwork identification; and (5) investigation on possible ways to improve the network by assigning weights or confidence values to different types of associations or associations from different sources.…”
Section: Discussionmentioning
confidence: 99%
“…The biomedical literature contains lots of potentially valuable PPI data, and extraction of this data is an important research topic in the field of biomedical natural language processing [ 15 , 16 ].…”
Section: Methodsmentioning
confidence: 99%
“…Instead, publicly accessible annotated PPI corpora such as GENIA [ 18 ] and AImed [ 19 ] allow automatic extraction of PPI data using machine learning methods. Recent studies [ 15 , 16 ] have established the power of machine learning methods, which handle PPI extraction as a classification problem. The major challenge is in supplying the learner with the semantic/syntactic information-containing features in order to distinguish between interactions and non-interactions.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations