2014 IEEE International Conference on Cluster Computing (CLUSTER) 2014
DOI: 10.1109/cluster.2014.6968737
|View full text |Cite
|
Sign up to set email alerts
|

POSTER: Boosting the performance of remote GPU virtualization using InfiniBand connect-IB and PCIe 3.0

Abstract: Abstract-A clear trend has emerged involving the acceleration of scientific applications by using GPUs. However, the capabilities of these devices are still generally underutilized. Remote GPU virtualization techniques can help increase GPU utilization rates, while reducing acquisition and maintenance costs. The overhead of using a remote GPU instead of a local one is introduced mainly by the difference in performance between the internode network and the intranode PCIe link. In this paper we show how using th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…For example, the InfiniBand Verbs API may be used instead of the TCP/IP protocol stack, boosting network throughput. Also, an efficient communication pipeline could be leveraged, as in the rCUDA remote GPU virtualization framework [10]. Another possibility is using the GPU Direct RDMA mechanism provided by NVIDIA and Mellanox [17].…”
Section: Performance When Communications Are Improvedmentioning
confidence: 99%
See 2 more Smart Citations
“…For example, the InfiniBand Verbs API may be used instead of the TCP/IP protocol stack, boosting network throughput. Also, an efficient communication pipeline could be leveraged, as in the rCUDA remote GPU virtualization framework [10]. Another possibility is using the GPU Direct RDMA mechanism provided by NVIDIA and Mellanox [17].…”
Section: Performance When Communications Are Improvedmentioning
confidence: 99%
“…The performance estimation methodology consists in replacing, in the results presented in the previous section, the communication time between main memory in the client and the GPU memory in the server (including the intermediate stop at the server's main memory) by the time that an optimized communication layer would attain. Notice that for estimating the time required to move data to and from the remote server, which depends on the volume of input and output data and also on the network bandwidth attained for each transfer size, the bandwidth achieved by the rCUDA remote GPU virtualization framework [10] has been used instead of using the raw bandwidth of the network fabric. This approach is more accurate than using the raw InfiniBand bandwidth because software layers always impose some loss to theoretical performance numbers.…”
Section: Performance When Communications Are Improvedmentioning
confidence: 99%
See 1 more Smart Citation
“…GPU virtualization solutions to GPGPU as GVirtuS have been implemented in research projects as rCUDA (Remote CUDA) [33] e DS-CUDA (DistributedShared CUDA) [17]. They all use an approach similar to GVirtuS, providing CUDA API wrappers on the front-end application in the guest OS while the back-end in the host OS accesses to the CUDA devices.…”
Section: Related Workmentioning
confidence: 99%