An Efficient Algorithm for Frequent Itemset Mining on Data Streams

Xie, Zhijun; Chen, Hong; Li, Cuiping

doi:10.1007/11790853_37

Cited by 17 publications

(10 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On the other hand, the best case is when every transaction is the same, with the number of tail-nodes being one. Moreover, keeping the tid information in a tree structure has also been found in literature discussing the efficient mining of frequent patterns [5]- [7]. To a certain extent, some of those approaches additionally maintain a support count and/or the tid information [6], [7] in each tree node.…”

Section: Propertymentioning

confidence: 99%

See 1 more Smart Citation

Mining Regular Patterns in Transactional Databases

Tanbeer

Ahmed

Jeong

et al. 2008

IEICE Transactions on Information and Systems

View full text Add to dashboard Cite

SUMMARYThe frequency of a pattern may not be a sufficient criterion for identifying meaningful patterns in a database. The temporal regularity of a pattern can be another key criterion for assessing the importance of a pattern in several applications. A pattern can be said regular if it appears at a regular user-defined interval in the database. Even though there have been some efforts to discover periodic patterns in time-series and sequential data, none of the existing studies have provided an appropriate method for discovering the patterns that occur regularly in a transactional database. Therefore, in this paper, we introduce a novel concept of mining regular patterns from transactional databases. We also devise an efficient tree-based data structure, called a Regular Pattern tree (RP-tree in short), that captures the database contents in a highly compact manner and enables a pattern growth-based mining technique to generate the complete set of regular patterns in a database for a user-defined regularity threshold. Our performance study shows that mining regular patterns with an RP-tree is time and memory efficient, as well as highly scalable.

show abstract

Section: Propertymentioning

confidence: 99%

“…Mining patterns that appear frequently in transactional databases [1], [2], [7], [14] has been widely studied for over a decade. The rationale behind mining frequent patterns is that only patterns occurring at a high frequency are of interest to users.…”

Section: Introductionmentioning

confidence: 99%

Mining Regular Patterns in Transactional Databases

Tanbeer

Ahmed

Jeong

et al. 2008

IEICE Transactions on Information and Systems

View full text Add to dashboard Cite

show abstract

“…Most studies about finding frequent patterns in a data stream are based on the landmark window model [25,41,44] or the sliding window model [3,6,22,27,29,24]. The first attempt to mine frequent patterns over the entire history of streaming data was proposed by Manku and Motwani [28].…”

Section: Related Workmentioning

confidence: 99%

“…They developed two single-pass algorithms, Sticky-Sampling and Lossy Counting, both of which are based on the anti-monotone 1 property; these algorithms provide approximate results with an error bound. Zhi-Jun et al [44] used a lattice structure, referred to as a frequent enumerate tree, which is divided into several equivalent classes of stored patterns with the same transaction-ids in a single class. Frequent patterns are divided into equivalent classes, and only those frequent patterns that represent the two borders of each class are maintained; other frequent patterns are pruned.…”

Section: Related Workmentioning

confidence: 99%

Sliding window-based frequent pattern mining over data streams

Tanbeer

Ahmed

Jeong

et al. 2009

Information Sciences

135

View full text Add to dashboard Cite

“…Most of them are based on the landmark window model [17,29,34] and the sliding window model [14,16,18,25]. DSM-FI [17] is a landmark based algorithm.…”

Section: Related Workmentioning

confidence: 99%

EclatDS: An efficient sliding window based frequent pattern mining method for data streams

Deypir

Sadreddini

2011

IDA

View full text Add to dashboard Cite

Mining frequent patterns over data streams is an interesting problem due to its wide application area. The researchers in this field have been facing two key challenges, namely reduction in runtime and memory usage. In this study, a novel method for efficient mining of frequent patterns over data streams is proposed. The method is based on sliding window model which divides the window into a number of panes. This method provides a new sliding window mechanism by utilizing a set of simple short lists. Each list stores related information about an item in the sliding window. The proposed mechanism dynamically adopts itself with the concept change. This method is empirically evaluated against recently proposed pane based sliding window algorithms. Experimental results on synthetically generated and real life data streams show the superiority of the proposed method with multiple orders of magnitude in terms of runtime and memory usage with respect to other pane based sliding window algorithms.

show abstract

An Efficient Algorithm for Frequent Itemset Mining on Data Streams

Cited by 17 publications

References 10 publications

Mining Regular Patterns in Transactional Databases

Mining Regular Patterns in Transactional Databases

Sliding window-based frequent pattern mining over data streams

EclatDS: An efficient sliding window based frequent pattern mining method for data streams

Contact Info

Product

Resources

About