2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2019
DOI: 10.1109/ipdps.2019.00035
|View full text |Cite
|
Sign up to set email alerts
|

Slate: Enabling Workload-Aware Efficient Multiprocessing for Modern GPGPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(7 citation statements)
references
References 21 publications
0
7
0
Order By: Relevance
“…Several studies focused on software mechanisms to improve the efficiency of multi-processing on GPUs. T. Allen et al proposed a framework called Slate that optimizes the combination of co-located processes and dynamically adjusts the scales of them [27]. smCompactor is a similar framework to Slate, which aims at maximizing the resource utilization [28].…”
Section: Related Workmentioning
confidence: 99%
“…Several studies focused on software mechanisms to improve the efficiency of multi-processing on GPUs. T. Allen et al proposed a framework called Slate that optimizes the combination of co-located processes and dynamically adjusts the scales of them [27]. smCompactor is a similar framework to Slate, which aims at maximizing the resource utilization [28].…”
Section: Related Workmentioning
confidence: 99%
“…Allen et al [5] presented Slate, a software-based workloadaware GPGPU multitasking framework. Similar to Maestro, Slate selects concurrent workloads that have complementary resource demands at run-time to minimize interference for individual workloads and improve resource utilization.…”
Section: Related Workmentioning
confidence: 99%
“…Multitasking of heterogeneous workloads has not received much attention from traditional GPU management as GPUs are generally adopted in systems dedicated to specific workloads. However, due to the widespread adoption of cloud systems, heterogeneous workloads are concurrently executed within a GPGPU device, and thus maximizing resource utilization by multitasking in GPGPU has become an important issue [4,5,6]. As shown in Figure 1, modern cloud systems are equipped with GPGPU devices along with traditional host resources (CPU, memory, storage, etc.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we tackle a more complex problem as it includes not only the task to partition allocation, but as well defining the partition size. Therefore, we investigate tractable heuristics aiming to explore the space of possible solutions with a reasonable complexity (pseudo polynomial complexity of θ(M • N • H) 2 . We highlight that we may omit the task index when it is not necessary.…”
Section: Heuristicsmentioning
confidence: 99%
“…The authors of [2] propose a software based kernel scheduler, Slate. Slate finds complementary resource demands to co-schedule kernels while minimizing the interference between concurrent kernels.…”
Section: Related Workmentioning
confidence: 99%