2016
DOI: 10.1007/978-3-662-54037-4_2
|View full text |Cite
|
Sign up to set email alerts
|

Pay-as-you-go Configuration of Entity Resolution

Abstract: Abstract. Entity resolution, which seeks to identify records that represent the same entity, is an important step in many data integration and data cleaning applications. However, entity resolution is challenging both in terms of scalability (all-against-all comparisons are computationally impractical) and result quality (syntactic evidence on record equivalence is often equivocal). As a result, end-to-end entity resolution proposals involve several stages, including blocking to efficiently identify candidate … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 30 publications
0
14
0
Order By: Relevance
“…The entity resolution problem has been referred in the literature with multiple terms including deduplication, entity linkage, and entity matching [4,15]. Entity resolution has been used in various fields such as matching profiles in social networks [2], bioinformatics data [3], biomedical data [16], publication data [4,5], genealogical data [6], product data [4,5], etc. The attributes of the entities are compared, and a similarity value is assigned.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The entity resolution problem has been referred in the literature with multiple terms including deduplication, entity linkage, and entity matching [4,15]. Entity resolution has been used in various fields such as matching profiles in social networks [2], bioinformatics data [3], biomedical data [16], publication data [4,5], genealogical data [6], product data [4,5], etc. The attributes of the entities are compared, and a similarity value is assigned.…”
Section: Related Workmentioning
confidence: 99%
“…Other approaches decide the include the uncertainty of a match into the decision [19]. Finally, matching the entities can also be based on the feedback of an oracle [4,5] or of a user [5].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In pay-as-you-go data preparation, following from the vision of dataspaces [12], a best-effort integration is produced through automated bootstrapping, which is refined in the light of user feedback. There have been many proposals for pay-as-you-go data preparation components, for example relating to data extraction [10], matching [18,36], mapping [5,6,30] and entity resolution [14,25]. Such proposals have addressed issues such as targeting the most appropriate feedback [10,28,35], and accommodating unreliable feedback, in particular in the context of crowdsourcing [9,23].…”
Section: Related Workmentioning
confidence: 99%
“…This may be the closest work to ours, though the focus is on a single data integration step, for which custom feedback has been obtained. Also for entity resolution, feedback on matching pairs is used in [25] as part of a single optimisation step that configures together blocking, detailed comparison and clustering. This contrasts with the current paper in focusing different aspects of the same integration step.…”
Section: Related Workmentioning
confidence: 99%