2007 IEEE 23rd International Conference on Data Engineering 2007
DOI: 10.1109/icde.2007.367895
|View full text |Cite
|
Sign up to set email alerts
|

Group Linkage

Abstract: Poor quality data is prevalent in databases due to a variety of reasons, including transcription errors, lack of standards for recording database fields, etc. To be able to query and integrate such data, considerable recent work has focused on the record linkage problem, i.e., determine if two entities represented as relational records are approximately the same. Often entities are represented as groups of relational records, rather than individual relational records, e.g., households in a census survey consis… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
45
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 55 publications
(45 citation statements)
references
References 20 publications
0
45
0
Order By: Relevance
“…Group linkage [19] extends the scenario to take groups of records into consideration. Group-wise similarity [19] is calculated based on record-level similarity. When exact matching is enforced at the record level, Jaccard similarity [15] is employed at the group level: similarity of two groups (R and S) is defined as: SIM (R, S) = |R ∩ S|/|R ∪ S|.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Group linkage [19] extends the scenario to take groups of records into consideration. Group-wise similarity [19] is calculated based on record-level similarity. When exact matching is enforced at the record level, Jaccard similarity [15] is employed at the group level: similarity of two groups (R and S) is defined as: SIM (R, S) = |R ∩ S|/|R ∪ S|.…”
Section: Related Workmentioning
confidence: 99%
“…linked) if and only if SIM (R i , S j ) ≥ θ, where SIM () is an arbitrary group similarity function and θ is a pre-negotiated threshold. In this paper, we follow the original group linkage definition [19] to use Jaccard similarity [15] as the group-level similarity measurement (see Section 3.2 for details). In real-world applications, the number of elements in groups is usually small (e.g.…”
Section: Problem Statementmentioning
confidence: 99%
See 3 more Smart Citations