Proceedings 2004 VLDB Conference 2004
DOI: 10.1016/b978-012088469-8.50057-7
|View full text |Cite
|
Sign up to set email alerts
|

Merging the Results of Approximate Match Operations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2005
2005
2021
2021

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(26 citation statements)
references
References 14 publications
0
26
0
Order By: Relevance
“…The superscript is a bit vector indicating the membership of tuples seen so far for that state. For example, s 10 1,2 = t 1 , ¬t 2 and s 01 1,2 = ¬t 1 , t 2 . Now, assume n is the number of all tuples, consider an example satisfying the following two conditions:…”
Section: Complexity Analysis On U-topk and U-kranks 761 U-topkmentioning
confidence: 99%
“…The superscript is a bit vector indicating the membership of tuples seen so far for that state. For example, s 10 1,2 = t 1 , ¬t 2 and s 01 1,2 = ¬t 1 , t 2 . Now, assume n is the number of all tuples, consider an example satisfying the following two conditions:…”
Section: Complexity Analysis On U-topk and U-kranks 761 U-topkmentioning
confidence: 99%
“…The use of supervised (training-based) approaches or learners aims at automating the process of entity matching to reduce the required manual effort. Training-based approaches, e.g., Naïve Bayes [49], logistic regression [46], Support Vector Machine (SVM) [11,43,49] or decision trees [63,29,49,53,54,56] have so far been used for some subtasks, e.g., determining suitable parameterizations for matchers or adjusting combination functions parameters (weights for matchers, offsets). However, training-based approaches require suitable training data and providing such data typically involves manual effort.…”
Section: Combination Of Matchersmentioning
confidence: 99%
“…A single match approach typically performs very differently for different domains and match problems. For example, it has been shown that there is no universally best string similarity measure [29,50]. Instead it is often beneficial and necessary to combine several methods for improved matching quality, e.g., to consider the similarity of several attributes or to take into account relationships between entities.…”
Section: Introductionmentioning
confidence: 99%
“…(2) Random accessing and ranking supports mainly random access over the dataset until the answers have been retrieved. [10] uses foot-rule distance to measure the two rankings and model the rank problem as the minimum cost perfect matching problem, whereas [5] proposes to translate the top-k query into a range query in database. (3) Pre-materialization and rank indices organizes the tuples in a special way, then applies similarity match for the answer of ranked query.…”
Section: Related Workmentioning
confidence: 99%