2021
DOI: 10.1145/3442200
|View full text |Cite
|
Sign up to set email alerts
|

Neural Networks for Entity Matching: A Survey

Abstract: Entity matching is the problem of identifying which records refer to the same real-world entity. It has been actively researched for decades, and a variety of different approaches have been developed. Even today, it remains a challenging problem, and there is still generous room for improvement. In recent years, we have seen new methods based upon deep learning techniques for natural language processing emerge. In this survey, we present how neural networks have been used for entity matching. Specifi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 68 publications
(35 citation statements)
references
References 81 publications
0
24
0
2
Order By: Relevance
“…Both propose a solution to challenge (2), but as we discussed above, with important limitations. Neither addresses challenge (3). Moreover, they do not provide a unified and coherent way of actually communicating or visualizing an explanation to the end-user in the same way the original authors of LIME do -something we aim to do.…”
Section: Related Workmentioning
confidence: 96%
See 1 more Smart Citation
“…Both propose a solution to challenge (2), but as we discussed above, with important limitations. Neither addresses challenge (3). Moreover, they do not provide a unified and coherent way of actually communicating or visualizing an explanation to the end-user in the same way the original authors of LIME do -something we aim to do.…”
Section: Related Workmentioning
confidence: 96%
“…While early work focused on custom architectures and trained models from scratch, the current state of the art focuses on fine-tuning large natural language models such as BERT [7], which offers higher accuracy and decreases the need for training examples [18]. We refer to Barlaug and Gulla [3] for an extensive survey on deep learning for EM.…”
Section: Related Workmentioning
confidence: 99%
“…In MinHash LSH, we configured the number of bands and rows, restricting both of them to powers of two such that their product is also a power of two, i.e., 2 𝑛 with 𝑛 ∈ {7, 8, 9}. For 𝑘-shingles, we considered four common values for 𝑘, i.e., [2,5].…”
Section: Qualitative Analysismentioning
confidence: 99%
“…All experiments were run on Java 15. For MH-LSH, we used java-LSH, version 0.12 5 . For HP-and CP-LSH, we used the Python wrapper of FALCONN [1], version 1.3.1 6 .…”
Section: Quantitative Analysismentioning
confidence: 99%
See 1 more Smart Citation