2017
DOI: 10.1007/s00224-017-9770-0
|View full text |Cite
|
Sign up to set email alerts
|

Document Spanners: From Expressive Power to Decision Problems

Abstract: We examine document spanners, a formal framework for information extraction that was introduced by Fagin, Vansummeren (PODS 2013, JACM 2015). A document spanner is a function that maps an input string to a relation over spans (intervals of positions of the string). We focus on document spanners that are defined by regex formulas, which are basically regular expressions that map matched subexpressions to corresponding spans, and on core spanners, which extend the former by standard algebraic operators and str… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
53
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 25 publications
(53 citation statements)
references
References 30 publications
0
53
0
Order By: Relevance
“…Proof. The upper bound is obvious (even for core spanners, see [16]). For the lower bound, we construct a reduction from 3CNF-satisfiability to the evaluation problem of Boolean regex CQs.…”
Section: Lower Boundsmentioning
confidence: 99%
See 3 more Smart Citations
“…Proof. The upper bound is obvious (even for core spanners, see [16]). For the lower bound, we construct a reduction from 3CNF-satisfiability to the evaluation problem of Boolean regex CQs.…”
Section: Lower Boundsmentioning
confidence: 99%
“…Theorem 5.1. [16] Evaluation of Boolean regex CQs with string equalities is NP-complete, even if restricted to queries of the form π ∅ ζ = x1,y1 · · · ζ = xm,ym α. In other words, even a single regex formula already leads to NP-hardness.…”
Section: Lower Boundmentioning
confidence: 99%
See 2 more Smart Citations
“…The main topic of this paper is a logic that captures core spanners. Freydenberger and Holldack [16] connected core spanners to EC reg , the existential theory of concatenation with regular constraints. Described very informally, EC reg is a logic that combines equations on words (like xaby = ybax) with positive logical connectives, and regular languages that constrain variable replacement.…”
Section: Introductionmentioning
confidence: 99%