Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data 2009
DOI: 10.1145/1559845.1559858
|View full text |Cite
|
Sign up to set email alerts
|

Uncertainty management in rule-based information extraction systems

Abstract: Rule-based information extraction is a process by which structured objects are extracted from text based on userdefined rules. The compositional nature of rule-based information extraction also allows rules to be expressed over previously extracted objects. Such extraction is inherently uncertain, due to the varying precision associated with the rules used in a specific extraction task. Quantifying this uncertainty is crucial for querying the extracted objects in probabilistic databases, and for improving the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
23
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 20 publications
(23 citation statements)
references
References 31 publications
0
23
0
Order By: Relevance
“…e.g. data collected from sensor networks [15], information extraction from the web [16,17], data integration [18,19], data cleaning [20][21][22][23][24][25], social networks [26,27], radio frequency identification RFID [7]. Due to various reasons that differ from one application to another, the uncertainty is inherent in such applications.…”
Section: Related Workmentioning
confidence: 99%
“…e.g. data collected from sensor networks [15], information extraction from the web [16,17], data integration [18,19], data cleaning [20][21][22][23][24][25], social networks [26,27], radio frequency identification RFID [7]. Due to various reasons that differ from one application to another, the uncertainty is inherent in such applications.…”
Section: Related Workmentioning
confidence: 99%
“…al. [30] look at the combination of the annotation rules adopted by each annotator. As in MEMM, it makes use of a maximal entropy classifier, but with "evidence vectors" being the output of rule-based annotators.…”
Section: Related Workmentioning
confidence: 99%
“…The earlier efforts in declarative IE [1,2,3] lack a unified framework supporting both a declarative interface as well as the state-of-the-art probabilistic IE models. Ways to handle uncertainties in IE have been considered in [14,15]. A probabilistic declarative IE system has been proposed in [4,13], but it only supports the Viterbi algorithm, which is unable to handle complex models that arise naturally from advanced features and relational operators.…”
Section: Related Workmentioning
confidence: 99%