Proceedings of the Conference on Empirical Methods in Natural Language Processing - EMNLP '08 2008
DOI: 10.3115/1613715.1613787
|View full text |Cite
|
Sign up to set email alerts
|

Weakly-supervised acquisition of labeled class instances using graph random walks

Abstract: We present a graph-based semi-supervised label propagation algorithm for acquiring opendomain labeled classes and their instances from a combination of unstructured and structured text sources. This acquisition method significantly improves coverage compared to a previous set of labeled classes and instances derived from free text, while achieving comparable precision.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
63
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 80 publications
(65 citation statements)
references
References 10 publications
0
63
0
Order By: Relevance
“…We are less concerned with extraction performance, but focus on the accuracy of the learned type system by measuring how well it performs in a prediction task. Talukdar et al (2008) and Talukdar and Pereira (2010) present graph-based approaches to the similar problem of class-instance learning. While this provides a way to discover types, it requires a large graph that does not easily generalize to new instances (transductive), since it produces no predictive model.…”
Section: Related Workmentioning
confidence: 99%
“…We are less concerned with extraction performance, but focus on the accuracy of the learned type system by measuring how well it performs in a prediction task. Talukdar et al (2008) and Talukdar and Pereira (2010) present graph-based approaches to the similar problem of class-instance learning. While this provides a way to discover types, it requires a large graph that does not easily generalize to new instances (transductive), since it produces no predictive model.…”
Section: Related Workmentioning
confidence: 99%
“…Entity clustering from semi-structured data has been addressed previously [18,23,9]. These approaches however do not address the issue of mixed-membership.…”
Section: Related Workmentioning
confidence: 99%
“…Now, analyzing the objective in Equation 12 in the manner outlined in Section 4, we arrive at the update rule shown in Equation 13.…”
Section: Extensions: Non-mutually Exclusive Labelsmentioning
confidence: 99%
“…Most algorithms only output label information to the unlabeled data in a transductive setting, while some algorithms are designed for the semi-supervised framework and build a classification model which can be applied to out-of-sample examples. Adsorption [1] is one such recently proposed graph based semi-supervised algorithm which has been successfully used for different tasks, such as recommending YouTube videos to users [1] and large scale assignment of semantic classes to entities within Information Extraction [13]. Adsorption has many desirable properties: it can perform multiclass classification, it can be parallelized and hence can be scaled to handle large data sets which is of particular importance for semi-supervised algorithms.…”
Section: Introductionmentioning
confidence: 99%