2006
DOI: 10.1007/11790853_37
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Algorithm for Frequent Itemset Mining on Data Streams

Abstract: In order to mining frequent itemsets on data stream efficiently, a new approach was proposed in this paper. The memory efficient and accurate one-pass algorithm divides all the frequent itemsets into frequent equivalence classes and prune all the redundant itemsets except for those represent the GLB(Greatest Lower Bound) and LUB(Least Upper Bound) of the frequent equivalence class and the number of GLB and LUB is much less than that of frequent itemsets. In order to maintain these equivalence classes, A compac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2006
2006
2015
2015

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 10 publications
0
10
0
Order By: Relevance
“…On the other hand, the best case is when every transaction is the same, with the number of tail-nodes being one. Moreover, keeping the tid information in a tree structure has also been found in literature discussing the efficient mining of frequent patterns [5]- [7]. To a certain extent, some of those approaches additionally maintain a support count and/or the tid information [6], [7] in each tree node.…”
Section: Propertymentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, the best case is when every transaction is the same, with the number of tail-nodes being one. Moreover, keeping the tid information in a tree structure has also been found in literature discussing the efficient mining of frequent patterns [5]- [7]. To a certain extent, some of those approaches additionally maintain a support count and/or the tid information [6], [7] in each tree node.…”
Section: Propertymentioning
confidence: 99%
“…Mining patterns that appear frequently in transactional databases [1], [2], [7], [14] has been widely studied for over a decade. The rationale behind mining frequent patterns is that only patterns occurring at a high frequency are of interest to users.…”
Section: Introductionmentioning
confidence: 99%
“…Most studies about finding frequent patterns in a data stream are based on the landmark window model [25,41,44] or the sliding window model [3,6,22,27,29,24]. The first attempt to mine frequent patterns over the entire history of streaming data was proposed by Manku and Motwani [28].…”
Section: Related Workmentioning
confidence: 99%
“…They developed two single-pass algorithms, Sticky-Sampling and Lossy Counting, both of which are based on the anti-monotone 1 property; these algorithms provide approximate results with an error bound. Zhi-Jun et al [44] used a lattice structure, referred to as a frequent enumerate tree, which is divided into several equivalent classes of stored patterns with the same transaction-ids in a single class. Frequent patterns are divided into equivalent classes, and only those frequent patterns that represent the two borders of each class are maintained; other frequent patterns are pruned.…”
Section: Related Workmentioning
confidence: 99%
“…Most of them are based on the landmark window model [17,29,34] and the sliding window model [14,16,18,25]. DSM-FI [17] is a landmark based algorithm.…”
Section: Related Workmentioning
confidence: 99%