2016 IEEE 24th Annual Symposium on High-Performance Interconnects (HOTI) 2016
DOI: 10.1109/hoti.2016.024
|View full text |Cite
|
Sign up to set email alerts
|

Offloading Collective Operations to Programmable Logic on a Zynq Cluster

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
5
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
2
2
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 14 publications
0
5
0
Order By: Relevance
“…This approach rst calls the barrier collective communication operation to synchronize all nodes, and then the reduce-all collective communication operation to complete the GALT operation [13]. Field programmable gate arrays (FPGAs) have been employed to enhance the operation of collective communication [14], [15].…”
Section: Related Workmentioning
confidence: 99%
“…This approach rst calls the barrier collective communication operation to synchronize all nodes, and then the reduce-all collective communication operation to complete the GALT operation [13]. Field programmable gate arrays (FPGAs) have been employed to enhance the operation of collective communication [14], [15].…”
Section: Related Workmentioning
confidence: 99%
“…This approach rst calls the barrier collective communication operation to synchronize all nodes, and then the reduce-all collective communication operation to complete the GALT operation [13]. Field programmable gate arrays (FPGAs) have been employed to enhance the operation of collective communication [14], [15].…”
Section: Related Workmentioning
confidence: 99%
“…Previous work has shown that significant performance speedups can be achieved by offloading collectives onto hardware. These generally enhance the NIC, [15][16][17][18] tightly connected with the processor via interconnects such as PCI, whereas the work reported here adds hardware support in the switch. For instance, Arap et al 15 offload collectives onto an FPGA cluster; however, they do not mention any communicator support, nor do they integrate into a switch.…”
Section: Related Workmentioning
confidence: 99%
“…These generally enhance the NIC, [15][16][17][18] tightly connected with the processor via interconnects such as PCI, whereas the work reported here adds hardware support in the switch. For instance, Arap et al 15 offload collectives onto an FPGA cluster; however, they do not mention any communicator support, nor do they integrate into a switch. Schmidt et al 16 implement MPI_Reduce in an FPGA cluster for the AIREN network.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation