2007
DOI: 10.1007/s10618-007-0083-9
|View full text |Cite
|
Sign up to set email alerts
|

Mining functional dependencies from data

Abstract: In this paper, we propose an efficient rule discovery algorithm, called FD_Mine, for mining functional dependencies from data. By exploiting Armstrong's Axioms for functional dependencies, we identify equivalences among attributes, which can be used to reduce both the size of the dataset and the number of functional dependencies to be checked. We first describe four effective pruning rules that reduce the size of the search space. In particular, the number of functional dependencies to be checked is reduced by… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
73
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 69 publications
(73 citation statements)
references
References 20 publications
0
73
0
Order By: Relevance
“…A functional dependency states that values in one set of columns functionally determine the value of another column. Again, much research has been performed to automatically detect Fds [75,139]. Section 5 surveys dependency discovery algorithms in detail.…”
Section: Dependenciesmentioning
confidence: 99%
See 2 more Smart Citations
“…A functional dependency states that values in one set of columns functionally determine the value of another column. Again, much research has been performed to automatically detect Fds [75,139]. Section 5 surveys dependency discovery algorithms in detail.…”
Section: Dependenciesmentioning
confidence: 99%
“…TANE [75], FUN [110], and FD_Mine [139] are three algorithms that follow this strategy, with FUN and FD_Mine introducing additional pruning rules beyond TANE's based on the properties of Fds. They start with sets of single columns in the LHS and work their way up the powerset lattice in a level-wise manner.…”
Section: Column-based Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…Another form of dependency, which is also relevant for data quality, is the functional dependency (FD). Again, much research has been performed to automatically detect FDs [26,45].…”
Section: State Of the Artmentioning
confidence: 99%
“…Yet they have been proved not to scale properly for big loads of data, which is the most common scenario in DW systems, as pointed out, for example, in (C. Monash, 2008;Golfarelli and Rizzi, 2009). For this reason, this is still a research topic giving rise to new proposals, like (Sismanis, Brown, Haas and Reinwald, 2006;Yao and Hamilton, 2008;Yeh, Li and Chu, 2008). Furthermore, note that these approaches exclusively work over instances and they cannot easily tolerate erroneous data (that may generate fake IDs that do not really hold or overlook real ones).…”
Section: Related Workmentioning
confidence: 99%