POSTER: Boosting the performance of remote GPU virtualization using InfiniBand connect-IB and PCIe 3.0

Reaño, Carlos; Silla, Federico; Peña, Antonio J.; Shainer, Gilad; Schultz, Scot; Castelló, Adrián; Quintana‐Ortí, Enrique S.; Duato, J.

doi:10.1109/cluster.2014.6968737

Cited by 4 publications

(4 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, the InfiniBand Verbs API may be used instead of the TCP/IP protocol stack, boosting network throughput. Also, an efficient communication pipeline could be leveraged, as in the rCUDA remote GPU virtualization framework [10]. Another possibility is using the GPU Direct RDMA mechanism provided by NVIDIA and Mellanox [17].…”

Section: Performance When Communications Are Improvedmentioning

confidence: 99%

“…The performance estimation methodology consists in replacing, in the results presented in the previous section, the communication time between main memory in the client and the GPU memory in the server (including the intermediate stop at the server's main memory) by the time that an optimized communication layer would attain. Notice that for estimating the time required to move data to and from the remote server, which depends on the volume of input and output data and also on the network bandwidth attained for each transfer size, the bandwidth achieved by the rCUDA remote GPU virtualization framework [10] has been used instead of using the raw bandwidth of the network fabric. This approach is more accurate than using the raw InfiniBand bandwidth because software layers always impose some loss to theoretical performance numbers.…”

Section: Performance When Communications Are Improvedmentioning

confidence: 99%

“…This is the case of networked disks, which allow a file system to be shared among many different computers. Likewise, it is possible to provide GPU-acceleration services to a cluster by sharing a networked GPU by means of the remote GPU virtualization technique, which has been implemented in frameworks such as rCUDA [10], GVirtuS [11], or DS-CUDA [12], among others. Furthermore, when remote GPU virtualization solutions are considered at the cluster level, they provide noticeable reductions in the total execution time of a given workload composed of a set of computing jobs [13] and in the total energy required to execute such workloads [14].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

On the Execution of Computationally Intensive CPU-Based Libraries on Remote Accelerators for Increasing Performance: Early Experience with the OpenBLAS and FFTW Libraries

Valero

Silla

2015

2015 IEEE International Conference on Cluster Computing

View full text Add to dashboard Cite

Section: Performance When Communications Are Improvedmentioning

confidence: 99%

Section: Performance When Communications Are Improvedmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

On the Execution of Computationally Intensive CPU-Based Libraries on Remote Accelerators for Increasing Performance: Early Experience with the OpenBLAS and FFTW Libraries

Valero

Silla

2015

2015 IEEE International Conference on Cluster Computing

View full text Add to dashboard Cite

“…GPU virtualization solutions to GPGPU as GVirtuS have been implemented in research projects as rCUDA (Remote CUDA) [33] e DS-CUDA (DistributedShared CUDA) [17]. They all use an approach similar to GVirtuS, providing CUDA API wrappers on the front-end application in the guest OS while the back-end in the host OS accesses to the CUDA devices.…”

Section: Related Workmentioning

confidence: 99%

On the Virtualization of CUDA Based GPU Remoting on ARM and X86 Machines in the GVirtuS Framework

Montella

Giunta

Laccetti

et al. 2016

Int J Parallel Prog

View full text Add to dashboard Cite

The astonishing development of diverse and different hardware platforms is twofold: on one side, the challenge for the exascale performance for big data processing and management; on the other side, the mobile and embedded devices for data collection and human machine interaction. This drove to a highly hierarchical evolution of programming models. GVirtuS is the general virtualization system developed in 2009 and firstly introduced in 2010 enabling a completely transparent layer among GPUs and VMs. This paper shows the latest achievements and developments of GVirtuS, now supporting CUDA 6.5, memory management and scheduling. Thanks to the new and improved remoting capabilities, GVirtus now enables GPU sharing among physical and virtual machines based on x86 and ARM CPUs on local workstations, computing clusters and distributed cloud appliances.

show abstract

Mobile Edge Decoding for Saving Energy and Improving Experience

Zhao

You

et al. 2017

2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and I

View full text Add to dashboard Cite

POSTER: Boosting the performance of remote GPU virtualization using InfiniBand connect-IB and PCIe 3.0

Cited by 4 publications

References 8 publications

On the Execution of Computationally Intensive CPU-Based Libraries on Remote Accelerators for Increasing Performance: Early Experience with the OpenBLAS and FFTW Libraries

On the Execution of Computationally Intensive CPU-Based Libraries on Remote Accelerators for Increasing Performance: Early Experience with the OpenBLAS and FFTW Libraries

On the Virtualization of CUDA Based GPU Remoting on ARM and X86 Machines in the GVirtuS Framework

Mobile Edge Decoding for Saving Energy and Improving Experience

Contact Info

Product

Resources

About