2020 International Conference on Field-Programmable Technology (ICFPT) 2020
DOI: 10.1109/icfpt51103.2020.00030
|View full text |Cite
|
Sign up to set email alerts
|

A Reconfigurable Compute-in-the-Network FPGA Assistant for High-Level Collective Support with Distributed Matrix Multiply Case Study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…MPI offers various primitives; among them collectives are integral part of MPI and they are frequently invoked in a spectrum of HPC applications [2]. Offloading MPI collectives to network devices (NICs and switches) is gaining much interest as an effective mechanism to improve the application performance [3]- [10]. More specifically, in-network processing unlocks higher application performance by reducing interand intra-node communication and bypassing MPI software layers.…”
Section: Introduction Message Passing Interface (Mpi)mentioning
confidence: 99%
See 1 more Smart Citation
“…MPI offers various primitives; among them collectives are integral part of MPI and they are frequently invoked in a spectrum of HPC applications [2]. Offloading MPI collectives to network devices (NICs and switches) is gaining much interest as an effective mechanism to improve the application performance [3]- [10]. More specifically, in-network processing unlocks higher application performance by reducing interand intra-node communication and bypassing MPI software layers.…”
Section: Introduction Message Passing Interface (Mpi)mentioning
confidence: 99%
“…More specifically, in-network processing unlocks higher application performance by reducing interand intra-node communication and bypassing MPI software layers. As new classes of devices including programmable NICs/switches [11], [12], Data Processing Units (DPUs) [13], and accelerators (FPGAs, GPUs) [14]- [16] are emerging in the datacenters [17], [18], we posit that there is an unrevealed opportunity to further improve the performance by extending in-network collective processing to a new class of complex collectives.…”
Section: Introduction Message Passing Interface (Mpi)mentioning
confidence: 99%