Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073) 2000
DOI: 10.1109/icde.2000.839382
|View full text |Cite
|
Sign up to set email alerts
|

Rules of thumb in data engineering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
71
0

Year Published

2005
2005
2017
2017

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 133 publications
(73 citation statements)
references
References 9 publications
2
71
0
Order By: Relevance
“…Because of high latency of disk, larger clusters are generally expected to yield better sort performance despite the limited fan-out in run generation and the increased number of merge steps. In fact, it is claimed that the optimal size of cluster has steadily increased roughly from 16 or 32 KBytes to 128 KBytes or even larger over the past decade, as the gap between latency and bandwidth improvement has become wider [7,9].…”
Section: External Sortmentioning
confidence: 99%
“…Because of high latency of disk, larger clusters are generally expected to yield better sort performance despite the limited fan-out in run generation and the increased number of merge steps. In fact, it is claimed that the optimal size of cluster has steadily increased roughly from 16 or 32 KBytes to 128 KBytes or even larger over the past decade, as the gap between latency and bandwidth improvement has become wider [7,9].…”
Section: External Sortmentioning
confidence: 99%
“…Data format transformation is needed because, in our case, CDR is not ideal to be directly processed by MapReduce. Based on the works [1,9], in fixed disk mode, sequential access has much better performance than random access. So in our case of using conventional disks, sequential pattern implementation will improve ingress performance.…”
Section: System Analysismentioning
confidence: 99%
“…Following Moore's law, the speed of hardware is likely to increase by almost a factor of two in less than two years. However, as processor speeds continue to outpace memory speeds [10], the gap between processor and memory performance increases by about 50 % per year [5]. Thus, there is a need to minimize the effects of such high performance hardware.…”
Section: Preliminariesmentioning
confidence: 99%