2009
DOI: 10.1007/978-3-642-04031-3_24
|View full text |Cite
|
Sign up to set email alerts
|

Semi-supervised Prediction of Protein Interaction Sentences Exploiting Semantically Encoded Metrics

Abstract: Abstract. Protein-protein interaction (PPI) identification is an integral component of many biomedical research and database curation tools. Automation of this task through classification is one of the key goals of text mining (TM). However, labelled PPI corpora required to train classifiers are generally small. In order to overcome this sparsity in the training data, we propose a novel method of integrating corpora that do not contain relevance judgements. Our approach uses a semantic language model to gather… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2011
2011
2011
2011

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…Thus, to study the novelty of this approach we eliminate, as much as possible, reliance on further language processing algorithms. Consequently, by examining a simpler task, we produce results that are not directly comparable with the kernel-based PPI extraction methods described in [ 26 ], but are comparable to the baseline results described in [ 34 , 37 ]. However, the approach described here is modular, and can be augmented for use with methods that rely on deep linguistic features.…”
Section: Introductionmentioning
confidence: 88%
“…Thus, to study the novelty of this approach we eliminate, as much as possible, reliance on further language processing algorithms. Consequently, by examining a simpler task, we produce results that are not directly comparable with the kernel-based PPI extraction methods described in [ 26 ], but are comparable to the baseline results described in [ 34 , 37 ]. However, the approach described here is modular, and can be augmented for use with methods that rely on deep linguistic features.…”
Section: Introductionmentioning
confidence: 88%