Influence of InfiniBand FDR on the performance of remote GPU virtualization

Reaño, Carlos; Mayo, Rafael; Quintana‐Ortí, Enrique S.; Silla, Federico; Duato, J.; Peña, Antonio J.

doi:10.1109/cluster.2013.6702662

Cited by 20 publications

(23 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…So the predominant communication mode between a compute node and composable GPU and FPGA resources is likely through bulk data transfer. It has been shown by [37] that adequate bandwidth such as those offered by RDMA at FDR data rate (56 Gb/s) already demonstrated superior performance than a locally connected GPU.…”

Section: Software Stackmentioning

confidence: 99%

Composable architecture for rack scale big data computing

Franke

Parris

et al. 2017

Future Generation Computer Systems

View full text Add to dashboard Cite

Keywords:Big data platforms, Composable system architecture, Disaggregated datacenter architecture, composable datacenter, software defined environments, software defined networking. Abstract:The rapid growth of cloud computing, both in terms of the spectrum and volume of cloud workloads, necessitate re-visiting the traditional rack-mountable servers based datacenter design. Next generation datacenters need to offer enhanced support for: (i) fast changing system configuration requirements due to workload constraints, (ii) timely adoption of emerging hardware technologies, and (iii) maximal sharing of systems and subsystems in order to lower costs. Disaggregated datacenters, constructed as a collection of individual resources such as CPU, memory, disks etc., and composed into workload execution units on demand, are an interesting new trend that can address the above challenges. In this paper, we demonstrated the feasibility of composable systems through building a rack scale composable system prototype using PCIe switch. Through empirical approaches, we develop assessment of the opportunities and challenges for leveraging the composable architecture for rack scale cloud datacenters with a focus on big data and NoSQL workloads. In particular, we compare and contrast the programming models that can be used to access the composable resources, and developed the implications for the network and resource provisioning and management for rack scale architecture.

show abstract

Section: Software Stackmentioning

confidence: 99%

Composable architecture for rack scale big data computing

Franke

Parris

et al. 2017

Future Generation Computer Systems

View full text Add to dashboard Cite

show abstract

“…Although remote GPU virtualization has demonstrated very low overhead with respect to a configuration with a local GPU [22], due to its novelty, this technology is not yet supported by the job schedulers that are commonly encountered in production clusters (e.g., SLURM [23], PBSPro [24], MOAB [25], TORQUE [26], LSF [27], OAR [28], MAUI [29], LoadLever [30], Condor [31], and Sun Grid Engine [32]). In particular, a common job scheduler in production today only deals with real GPUs so that, when a job requests a number of nodes equipped with one (or more) GPU(s), the scheduler will try to map that job to nodes that actually own the requested number of GPUs, thus impairing the benefits of GPU virtualization.…”

Section: Introductionmentioning

confidence: 99%

SLURM Support for Remote GPU Virtualization: Implementation and Performance Study

Iserte

Castelló

Mayo

et al. 2014

2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing

View full text Add to dashboard Cite

“…Remote GPU virtualization techniques can help increase GPU utilization rates, while reducing acquisition and maintenance costs. For these reasons, many different virtualization solutions are available today, such as rCUDA [4], [5], SnuCL [6], GVirtuS [7], DS-CUDA [8], and VOCL [9].…”

Section: Introductionmentioning

confidence: 99%

POSTER: Boosting the performance of remote GPU virtualization using InfiniBand connect-IB and PCIe 3.0

Reaño

Silla

Peña

et al. 2014

2014 IEEE International Conference on Cluster Computing (CLUSTER)

Self Cite

View full text Add to dashboard Cite

Abstract-A clear trend has emerged involving the acceleration of scientific applications by using GPUs. However, the capabilities of these devices are still generally underutilized. Remote GPU virtualization techniques can help increase GPU utilization rates, while reducing acquisition and maintenance costs. The overhead of using a remote GPU instead of a local one is introduced mainly by the difference in performance between the internode network and the intranode PCIe link. In this paper we show how using the new InfiniBand Connect-IB network adapters (attaining similar throughput to that of the most recently emerged GPUs) boosts the performance of remote GPU virtualization, reducing the overhead to a mere 0.19% in the application tested.

show abstract

Influence of InfiniBand FDR on the performance of remote GPU virtualization

Cited by 20 publications

References 17 publications

Composable architecture for rack scale big data computing

Composable architecture for rack scale big data computing

SLURM Support for Remote GPU Virtualization: Implementation and Performance Study

POSTER: Boosting the performance of remote GPU virtualization using InfiniBand connect-IB and PCIe 3.0

Contact Info

Product

Resources

About