2017
DOI: 10.1016/j.is.2016.09.001
|View full text |Cite
|
Sign up to set email alerts
|

TopPI: An efficient algorithm for item-centric mining

Abstract: International audienceIn this paper, we introduce item-centric mining, a new semantics for mining long-tailed datasets. Our algorithm, TopPI, finds for each item its top-k most frequent closed itemsets. While most mining algorithms focus on the globally most frequent itemsets, TopPI guarantees that each item is represented in the results, regardless of its frequency in the database. TopPI allows users to efficiently explore Web data, answering questions such as " what are the k most common sets of songs downlo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…Each scenario requires to ingest and prepare data as a set of transactions. The transactions are fed to j LCM [16], our open-source parallel and distributed pattern mining algorithm that runs on MapReduce [13], to compute association rules. To cope with the skewed distribution of our transactions, j LCM is parameterizable and is used to mine per-item top-k itemsets.…”
Section: Empowering Domain Expertsmentioning
confidence: 99%
See 1 more Smart Citation
“…Each scenario requires to ingest and prepare data as a set of transactions. The transactions are fed to j LCM [16], our open-source parallel and distributed pattern mining algorithm that runs on MapReduce [13], to compute association rules. To cope with the skewed distribution of our transactions, j LCM is parameterizable and is used to mine per-item top-k itemsets.…”
Section: Empowering Domain Expertsmentioning
confidence: 99%
“…Extracting itemsets using jLCM Generating association rules, presented in Section 2.2, requires to first extract frequent itemsets from T . We use j LCM [16], our open-source parallel and distributed pattern mining algorithm that runs on MapReduce [13]. Mining frequent itemsets is done in two steps.…”
Section: Miningmentioning
confidence: 99%