2009
DOI: 10.1145/1577129.1577133
|View full text |Cite
|
Sign up to set email alerts
|

How to simulate 1000 cores

Abstract: This paper proposes a novel methodology to efficiently simulate shared-memory multiprocessors composed of hundreds of cores. The basic idea is to use thread-level parallelism in the software system and translate it into corelevel parallelism in the simulated world. To achieve this, we first augment an existing full-system simulator to identify and separate the instruction streams belonging to the different software threads. Then, the simulator dynamically maps each instruction flow to the corresponding core of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 55 publications
(21 citation statements)
references
References 23 publications
0
21
0
Order By: Relevance
“…This is due to excessive synchronisation in the benchmarks beyond 64 cores, and the fact that our simulator can execute tight spin-lock loops at near native speed. It is a well-known fact that the SPLASH-2 benchmarks attract high synchronisation costs for large-scale hardware configurations, as shown by other research [12]. The MULTIBENCH results are not affected in the same way due to less synchronisation.…”
Section: E Scalabilitymentioning
confidence: 61%
“…This is due to excessive synchronisation in the benchmarks beyond 64 cores, and the fact that our simulator can execute tight spin-lock loops at near native speed. It is a well-known fact that the SPLASH-2 benchmarks attract high synchronisation costs for large-scale hardware configurations, as shown by other research [12]. The MULTIBENCH results are not affected in the same way due to less synchronisation.…”
Section: E Scalabilitymentioning
confidence: 61%
“…We employ the GEMS simulator [26] fed with memory accesses generated by PIN [25] as explained by Monchiero et al [27]. The interconnection network has been modeled with GARNET [5], included in the GEMS toolset.…”
Section: Simulation Methodologymentioning
confidence: 99%
“…To model many-core multiprocessors, we take an approach similar to [9,20] of building a hierarchical framework where cycle-accurate simulations of individual cores are combined by a top-level chip-wide simulator to model an entire many-core processor. Our framework is illustrated in Figure 5.…”
Section: Evaluation Methodology 41 Simulation Infrastructurementioning
confidence: 99%