2017
DOI: 10.1007/978-3-319-70407-4_30
|View full text |Cite
|
Sign up to set email alerts
|

JedAI: The Force Behind Entity Resolution

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 3 publications
0
5
0
Order By: Relevance
“…In this paper, we propose the NA-BLOCKER technique, which is capable of tolerating noisy data, extracting information regarding the schema of the data sources, generating groups of similar attributes, and pruning the generated blocking results in order to enhance the quality of the final blocks. Since Web approaches need to deal with data sources that present noisy and heterogeneous data, the proposed technique can be useful for these approaches, such as LIMES [20], LOV [28] and JedAI [24]. Based on the experimental results, we can highlight that NA-BLOCKER presents better results regarding effectiveness and aggregate cardinality than the state-of-the-art technique.…”
Section: Discussionmentioning
confidence: 97%
“…In this paper, we propose the NA-BLOCKER technique, which is capable of tolerating noisy data, extracting information regarding the schema of the data sources, generating groups of similar attributes, and pruning the generated blocking results in order to enhance the quality of the final blocks. Since Web approaches need to deal with data sources that present noisy and heterogeneous data, the proposed technique can be useful for these approaches, such as LIMES [20], LOV [28] and JedAI [24]. Based on the experimental results, we can highlight that NA-BLOCKER presents better results regarding effectiveness and aggregate cardinality than the state-of-the-art technique.…”
Section: Discussionmentioning
confidence: 97%
“…Entity resolution, also known as record deduplication and entity matching, aims to identify records referencing the same real-world entity in one or multiple datasets [1,6]. ER solutions address blocking problems [7][8][9][10][11], rule matching problems [12][13][14][15], crowdsourcing [16][17][18], or machine learning problems [3][4][5]19]. Good overviews of ER can be found in surveys such as [20,21].…”
Section: Related Workmentioning
confidence: 99%
“…The pyramid-like structure consists of several hidden layers, with the bottom layer having the most hidden units, and each subsequent layer having half the number of hidden units as the previous layer. Specifically, the vector representations for sentence s and sentence l are defined as shown in (8).…”
Section: [ ] [ ] [ ] Cls Att Val Att Val Sep Att Val Att Val Sepmentioning
confidence: 99%
“…Note that JedAI has already been presented in a short journal paper [48] and as a demo in past conferences [12,84,85]. The first releases, i.e., version 1 [84], version 2 [12] and version 2.1 [48], cover exclusively the serialized execution of the budget-and schema-agnostic workflow that is presented in Section 4.1, while providing a rather limited experimental analysis of its performance [48]. The serialized implementation of the batch schema-based workflow and of the budget-and schemaagnostic workflow are briefly presented in [85], without evaluating their relative performance.…”
Section: Blockingmentioning
confidence: 99%