Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems 2022
DOI: 10.1145/3503222.3507776
|View full text |Cite
|
Sign up to set email alerts
|

FlexDriver: a network driver for your accelerator

Abstract: We propose a new system design for connecting hardware and FPGA accelerators to the network, allowing the accelerator to directly control commodity Network Interface Cards (NICs) without using the CPU. This enables us to solve the key challenge of leveraging existing NIC hardware offloads such as virtualization, tunneling, and RDMA for accelerator networking. Our approach supports a diverse set of use cases, from direct network access for disaggregated accelerators to inline-acceleration of the network stack, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 51 publications
0
7
0
Order By: Relevance
“…FlexDriver [25] is a hardware module for FPGAs, developed by NVIDIA. It provides a data-plane driver designed to communicate with a dedicated NIC using peer-to-peer PCIe, giving RDMA capabilities to accelerators without a full FPGA NIC offload.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…FlexDriver [25] is a hardware module for FPGAs, developed by NVIDIA. It provides a data-plane driver designed to communicate with a dedicated NIC using peer-to-peer PCIe, giving RDMA capabilities to accelerators without a full FPGA NIC offload.…”
Section: Related Workmentioning
confidence: 99%
“…This is attributed to the cost of traversing the PCIe interface (around 450ns per-hop [38]). This shows the benefits of our architecture in allowing direct control of the networking components from the FPGA fabric itself, as opposed to an architecture such as FlexDriver [25] in which NIC and FPGA fabric are separated by a PCIe hop, particularly for pointer-chasing workloads.…”
Section: Google Multi-chase Benchmarkmentioning
confidence: 99%
“…ORCA can also be included to these co-design frameworks as another compute resource. Lynx [158] proposes Smart NIC-based communication offloading for accelerator-rich systems, and FlexDriver [39] proposes PCIe-based NIC control by accelerator. ORCA takes one step further to let the client directly communicate with the accelerators, which also controls the NIC more efficiently in the coherence domain.…”
Section: Related Workmentioning
confidence: 99%
“…Emerging SmartNIC frameworks propose a split-KVS design, where a small amount of NIC memory stores the KVS' hottest items [26,81], embodying the "small cache, big effect" principle [28]. Like NetCache [44], such designs focus on reads and are inefficient for write-intensive workloads, because writes are handled on a slow path to keep the CPU's main memory up-to-date.…”
Section: Sensitivity Analysismentioning
confidence: 99%
“…System Software for RPCs. Although we implement C-4 over the NeBuLa architecture, our design could be implemented over a variety of baselines (e.g., NanoPU [39], FlexNIC [52], or NICA [26]). Due to the unique challenges of µs-scale RPCs that characterize KVS, several proposals pursue synchronization-free RPC load balancing [17,38,45,54,62,77,83,93].…”
Section: Concurrency and Synchronizationmentioning
confidence: 99%