2010
DOI: 10.5121/ijdms.2010.2403
|View full text |Cite
|
Sign up to set email alerts
|

Scaling Data Mining Algorithms to Large and Distributed Datasets

Abstract: Abstract. In the contemporary world of global economy real-life data is

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 26 publications
0
5
0
Order By: Relevance
“…In this case, different mining techniques are needed, and partitioning is one such technique. Most disk-based partitioning techniques [ 11 - 14 ] find frequent patterns from each partition and check to discover all frequent patterns. This approach, however, has some drawbacks, because frequent patterns may look infrequent due to local support pruning.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this case, different mining techniques are needed, and partitioning is one such technique. Most disk-based partitioning techniques [ 11 - 14 ] find frequent patterns from each partition and check to discover all frequent patterns. This approach, however, has some drawbacks, because frequent patterns may look infrequent due to local support pruning.…”
Section: Methodsmentioning
confidence: 99%
“…Another aspect to consider is the size of real DNA sequence databases, which is ever increasing. For the cases where a DNA sequence database can not fit into the main memory, disk-based mining has been studied, based on partitioning [ 11 - 14 ]. Most of these techniques, however, only consider local frequency counting, although many frequent patterns may look infrequent due to local support pruning.…”
Section: Introductionmentioning
confidence: 99%
“…In addition to BI, meteorology, petroleum exploration, and bioinformatics are among the scientific fields where big data and data mining are gaining popularity. Software, hardware, and sophisticated algorithms are required to support this data sequence [21]. M.Jayasree et al have proposed the difficulty in locating rules of association between products in a large database of sales transactions.…”
Section: Related Workmentioning
confidence: 99%
“…The hierarchy tree documents are ranked using CSI ranking, and It creates another level depending upon the document fitting to the same shard as the previous document. Further, a shard rank is determined using the Lex-S approach [22,23].…”
Section: Connected Shire (Conn-s)mentioning
confidence: 99%