2011 IEEE International Symposium on Workload Characterization (IISWC) 2011
DOI: 10.1109/iiswc.2011.6114176
|View full text |Cite
|
Sign up to set email alerts
|

On the memory system requirements of future scientific applications: Four case-studies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(19 citation statements)
references
References 12 publications
0
19
0
Order By: Relevance
“…The hubs, introduced in Section III, are switches interfacing the intra-cluster networks with the intercluster one. E-link-intra 0.25/0.50 pJ/bit (2.5/5 mm) E-link-inter 15.5 pJ/bit (12.5 cm) E-switch-dyn (4 port) 1 81.79 pJ/flit (x16) P-switch-static (4 port) 47.96 mW (x16) E-switch-dyn (6 port) 2 125.69 pJ/flit (x64) P-switch-static (6 port) 73.05 mW (x64) E-hub-dyn (7 port) 3 91.39 pJ/flit (x4) P-hub-static (7 port) 51.39 mW 4 (x4)…”
Section: A Simulation and Evaluation Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…The hubs, introduced in Section III, are switches interfacing the intra-cluster networks with the intercluster one. E-link-intra 0.25/0.50 pJ/bit (2.5/5 mm) E-link-inter 15.5 pJ/bit (12.5 cm) E-switch-dyn (4 port) 1 81.79 pJ/flit (x16) P-switch-static (4 port) 47.96 mW (x16) E-switch-dyn (6 port) 2 125.69 pJ/flit (x64) P-switch-static (6 port) 73.05 mW (x64) E-hub-dyn (7 port) 3 91.39 pJ/flit (x4) P-hub-static (7 port) 51.39 mW 4 (x4)…”
Section: A Simulation and Evaluation Methodologymentioning
confidence: 99%
“…These systems are very demanding from the memory bandwidth point of view because of the aggregate requirements of the constantly increasing number of cores per chip [3], [4]. On top of that, latency for accessing the memory hierarchy is typically the key to application performance and in some cases even more important than available bandwidth [5].…”
Section: Introductionmentioning
confidence: 99%
“…Dynamic energy is the product of the energy for a load or store and the number of loads and stores for each level of the memory hierarchy for a level, as shown in equation (3). Static energy is estimated as product of time (T) and static power of the memory hierarchy illustrated in equation (4). The static power is calculated as sum of static power of each level of cache and the refresh power of DRAM/eDRAM in the memory hierarchy.…”
Section: )mentioning
confidence: 99%
“…the memory wall [1]) and this gap in the performance of CPU and memory sub-systems will only increase further for future many-core systems [2]. Second, there is a growing capacity gap due to the limited scaling of DRAM, power, and cost limitations [3,4]. Finally, at the current DDR3 powerperformance level of approximately 600 MW/GB/s, in an Exascale system, the memory alone would draw 600MW [2].…”
Section: Introductionmentioning
confidence: 99%
“…A recent study has shown that the increase of the number of cores on a single chip puts great stress on the off-chip memory memory system: when executed on a 128-core system, HPC applications require 64 GB of capacity and may require up to 64 GB/s of off-chip memory bandwidth [10]. It is clear that current memory systems will not be able to sustain these requirements when the number of cores begins to increase.…”
Section: Introductionmentioning
confidence: 99%