Proceedings of the 46th International Symposium on Computer Architecture 2019
DOI: 10.1145/3307650.3322230
|View full text |Cite
|
Sign up to set email alerts
|

MGPUSim

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 61 publications
(5 citation statements)
references
References 40 publications
0
5
0
Order By: Relevance
“…NVArchSim (NVAS) [44] is the proprietary hybrid trace-driven simulator used by Nvidia in which different levels of abstraction (detailed versus high-abstraction timing models) are deployed to balance simulation speed and accuracy. MGPUSim [41] is a parallel simulator for modeling multi-GPU systems.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…NVArchSim (NVAS) [44] is the proprietary hybrid trace-driven simulator used by Nvidia in which different levels of abstraction (detailed versus high-abstraction timing models) are deployed to balance simulation speed and accuracy. MGPUSim [41] is a parallel simulator for modeling multi-GPU systems.…”
Section: Related Workmentioning
confidence: 99%
“…Sampling, through which a limited number of representative regions are simulated, is a widely used methodology. While there exists a large body of work on sampled simulation for CPUs [16]- [20], [24], [38], [39], [51], sampling techniques specifically developed and tailored for speeding up GPU simulation have only recently received attention, see in particular [23], [25], [41], [44]. The stateof-the-art GPU workload sampling methodology, and most closely related work compared to ours, is Principal Kernel Selection (PKS) [11] which was shown to yield high accuracy and high speed for a variety of GPU-compute workloads.…”
Section: Introductionmentioning
confidence: 99%
“…The AMD GCN architecture [6] is related to OpenCL platform model. A GPU device consists of several compute units.…”
Section: The Amd Gcn Architecturementioning
confidence: 99%
“…We characterize a BFS application from the SHOC Benchmark Suit [11] on a real-world graph and collect the memory trace with a GPU simulator [58]. This BFS application uses CSR graph format and warp-centric execution, similar to Grus.…”
Section: Adaptive Um Policymentioning
confidence: 99%