2020
DOI: 10.1145/3418896
|View full text |Cite
|
Sign up to set email alerts
|

An Overview of End-to-End Entity Resolution for Big Data

Abstract: One of the most critical tasks for improving data quality and increasing the reliability of data analytics is Entity Resolution (ER), which aims to identify different descriptions that refer to the same real-world entity. Despite several decades of research, ER remains a challenging problem. In this survey, we highlight the novel aspects of resolving Big Data entities when we should satisfy more than one of the Big Data characteristics simultaneously (i.e., Volume and Velocity with Vari… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
87
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 164 publications
(87 citation statements)
references
References 170 publications
0
87
0
Order By: Relevance
“…Blocking, which is surveyed by Christen [16], Papadakis et al [72,73], is considered an important subtask of entity matching, meant to tackle the quadratic complexity of potential matches. Christophides et al [17] specifically review entity matching techniques in the context of big data. There has been an uptick in interest in both machine learning and crowdsourcing as a solution to entity matching in recent years.…”
Section: Other Surveys and Extensive Overviewsmentioning
confidence: 99%
See 2 more Smart Citations
“…Blocking, which is surveyed by Christen [16], Papadakis et al [72,73], is considered an important subtask of entity matching, meant to tackle the quadratic complexity of potential matches. Christophides et al [17] specifically review entity matching techniques in the context of big data. There has been an uptick in interest in both machine learning and crowdsourcing as a solution to entity matching in recent years.…”
Section: Other Surveys and Extensive Overviewsmentioning
confidence: 99%
“…These steps can also be viewed as a chain of the subtasks or subproblems that make up entity matching. Inspired by processes and figures such as those in [15,17,24,36,66], Figure 2 depicts this reference model of the traditional entity matching process. We will use the model to frame the discussion of different methods using neural networks.…”
Section: The Entity Matching Processmentioning
confidence: 99%
See 1 more Smart Citation
“…Overviews of the main methods can be found in recent books [2,3,4,5], surveys [6,7,8] and tutorials [9,10,11,12].…”
Section: Introductionmentioning
confidence: 99%
“…See https://docs.docker.com/engine/install/debian for detailed instructions 8. See https://docs.docker.com/engine/install/fedora for detailed instructions 9.…”
mentioning
confidence: 99%