2022 3rd International Conference on Artificial Intelligence and Data Sciences (AiDAS) 2022
DOI: 10.1109/aidas56890.2022.9918724
|View full text |Cite
|
Sign up to set email alerts
|

Duplicate Detection Using Unsupervised Random Forests: A Preliminary Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…We started off with these two datasets in this study since they contain fields of mixed data types (numbers and strings). Based on the initial study that we performed in [32], any number values were set as integer data types and strings as characters. Both datasets have a one-to-one relationship, which means that duplicate records have only one match.…”
Section: Datasetmentioning
confidence: 99%
“…We started off with these two datasets in this study since they contain fields of mixed data types (numbers and strings). Based on the initial study that we performed in [32], any number values were set as integer data types and strings as characters. Both datasets have a one-to-one relationship, which means that duplicate records have only one match.…”
Section: Datasetmentioning
confidence: 99%
“…The field of duplication and its treatment methods is immense in the literature. It is addressed from different perspectives and domains, started from an overview of the duplication and techniques [21]- [23], improving the detection techniques [24], [25], evaluating the impact of the duplicates [26], and proposing new frameworks [27], [28] and methods [29], [30] to effectively enhance the detection process. However, little research addresses the impact of duplicates on the analysis results.…”
Section: Related Workmentioning
confidence: 99%