2017
DOI: 10.1109/tpds.2016.2630697
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time GPU Resource Management with Loadable Kernel Modules

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 31 publications
0
4
0
Order By: Relevance
“…For discrete CPU-GPU systems the point of interference is reduced to data movements between the discrete CPU and GPU memories over the PCIe bus. In this context, several approaches have been proposed [35] [36] [37]. Concerning integrated GPUs, the point of interference is not limited to a single peripheral bus, as the memory hierarchy itself is largely shared between the two devices.…”
Section: Related Workmentioning
confidence: 99%
“…For discrete CPU-GPU systems the point of interference is reduced to data movements between the discrete CPU and GPU memories over the PCIe bus. In this context, several approaches have been proposed [35] [36] [37]. Concerning integrated GPUs, the point of interference is not limited to a single peripheral bus, as the memory hierarchy itself is largely shared between the two devices.…”
Section: Related Workmentioning
confidence: 99%
“…When scanning IP pairs, SRLA only sets some bits of SEA to 1 or sets some DR to 0 and the results are the same regardless of the sequence of execution. Therefore, large amount of IP pairs can be processed simultaneously by updating SEA and DR via multiple threads [37].…”
Section: A Deploy On Gpumentioning
confidence: 99%
“…Some familiar metrics are IPC (Instructions Per Cycle) and cache_hit_rate. There are several metrics in CUPTI that reflect resource utilization, e.g., dram_utilization, L2_utilization, (11) scan (18) matrixMul (20) mergeSort (14) histogram (15) sortingNetworks (18) vectorAdd (18) transpose (19) Rodinia Pathfinder (10) bfs (16) backprop (8) nn (12) b+ (4) srad (4) hotspot (12) kmeans (20) alu_fu_utilization, or metrics that correspond to the size of data transferred among the resources of a GPU, such as local_load_transactions, and dram_write_transactions.…”
Section: Kernel Metricsmentioning
confidence: 99%
“…Therefore, one of the main targets of the proposed method is the scheduling layer of such systems. There already exist different scheduling methods, such as [9][10][11]. However, they rely on modifications in hardware specifications [12] while our proposed method is applied at software level.…”
Section: Introductionmentioning
confidence: 99%