Proceedings of the 1st International Workshop on Utility-Based Data Mining - UBDM '05 2005
DOI: 10.1145/1089827.1089839
|View full text |Cite
|
Sign up to set email alerts
|

A fast high utility itemsets mining algorithm

Abstract: Association rule mining (ARM) identifies frequent itemsets from databases and generates association rules by considering each item in equal value. However, items are actually different in many aspects in a number of real applications, such as retail marketing, network log, etc. The difference between items makes a strong impact on the decision making in these applications. Therefore, traditional ARM cannot meet the demands arising from these applications. By considering the different values of individual items… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
239
0
6

Year Published

2008
2008
2020
2020

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 307 publications
(245 citation statements)
references
References 19 publications
0
239
0
6
Order By: Relevance
“…However, such a method is too time-consuming for a large dataset environment. Several heuristic methods have been proposed to accelerate the discovery of high utility (or SH-frequent) itemsets, such as the MEU (UMining_H) [27,28,34,35], SIP, CAC, and IAB [4,6] methods. Nevertheless, these predictive methods may not discover some high utility itemsets.…”
Section: Tidmentioning
confidence: 99%
See 3 more Smart Citations
“…However, such a method is too time-consuming for a large dataset environment. Several heuristic methods have been proposed to accelerate the discovery of high utility (or SH-frequent) itemsets, such as the MEU (UMining_H) [27,28,34,35], SIP, CAC, and IAB [4,6] methods. Nevertheless, these predictive methods may not discover some high utility itemsets.…”
Section: Tidmentioning
confidence: 99%
“…Recently, Li et al first developed some efficient approaches, including the FSM, SuFSM, ShFSM, and DCG methods, to identify all SH-frequent itemsets [22][23][24]. In the meanwhile, Liu et al also presented a Two-Phase (TP) method to discover all high utility itemsets [27,28].…”
Section: Tidmentioning
confidence: 99%
See 2 more Smart Citations
“…Therefore, the weight value of each item was heuristically chosen to be between 0.1 and 0.9, and randomly generated using a log-normal distribution. Some other pattern mining research [26], [27] has adopted the same technique. Figure 4 shows the weight distribution of 2000 distinct items using the log-normal distribution.…”
Section: Synthetic Datasets With Synthetic Weightsmentioning
confidence: 99%