Proceedings of the 25th ACM International on Conference on Information and Knowledge Management 2016
DOI: 10.1145/2983323.2983781
|View full text |Cite
|
Sign up to set email alerts
|

Approximate Discovery of Functional Dependencies for Large Datasets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 19 publications
(17 citation statements)
references
References 13 publications
0
17
0
Order By: Relevance
“…For example, there is such a functional dependency A->B, which means that any two records in the relationship, when their values on the attribute set A are equal, the values on the attribute set B must be equal. Bleifuß et al [66] propose an approximate discovery strategy AID-FD (Approximate Iterative Discovery of FDs) which sacrifices a certain correct rate in exchange for performance improvement. AID-FD uses an incremental, focused sampling of tuple pairs to deduce non-FDs until user-configured termination criterion is met.…”
Section: B Sampling For Functional Dependenciesmentioning
confidence: 99%
“…For example, there is such a functional dependency A->B, which means that any two records in the relationship, when their values on the attribute set A are equal, the values on the attribute set B must be equal. Bleifuß et al [66] propose an approximate discovery strategy AID-FD (Approximate Iterative Discovery of FDs) which sacrifices a certain correct rate in exchange for performance improvement. AID-FD uses an incremental, focused sampling of tuple pairs to deduce non-FDs until user-configured termination criterion is met.…”
Section: B Sampling For Functional Dependenciesmentioning
confidence: 99%
“…Pyro focuses on approximate dependencies that may be violated by a certain portion of tuples or tuple pairs. Note that this is different from dependency approximation algorithms [8,22], which trade correctness guarantees of the discovered dependencies for performance improvements. In the following, we focus on those works that share goals or have technical commonalities with Pyro.…”
Section: Related Workmentioning
confidence: 95%
“…There has been a number of different formalisms to extend FDs to handle erroneous data inherent in real-world applications. Common approaches include approximate FDs [5] and conditional FDs [20,21] whereby a FD holds on a subset of data instead of the entire dataset (please refer to [7] for a detail survey of relaxed definitions of FDs). There has been some work on probabilistic FDs that might hold on the data with some probability [10,34].…”
Section: Related Workmentioning
confidence: 99%