2015 IEEE 23rd Annual Symposium on High-Performance Interconnects 2015
DOI: 10.1109/hoti.2015.21
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Offload Enabled Network Interfaces

Abstract: Network interface cards are one of the key components to achieve efficient parallel performance. In the past, they have gained new functionalities such as lossless transmission and remote direct memory access that are now ubiquitous in high-performance systems. Prototypes of next generation network cards now offer new features that facilitate device programming. In this work, various possible uses of network offload features are explored. We use the Portals 4 interface specification as an example to demonstrat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(12 citation statements)
references
References 25 publications
0
12
0
Order By: Relevance
“…Offloading the schedule execution to the network interface card (NIC) can provide different advantages such as asynchronous execution, lower latency, and streaming processing. Di Girolamo et al [16] show how solo collectives can be offloaded to Portals 4 [7] NICs by using triggered operations. This approach is limited by the amount of NIC resources that bounds the number of times a persistent schedule can be executed without application intervention.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Offloading the schedule execution to the network interface card (NIC) can provide different advantages such as asynchronous execution, lower latency, and streaming processing. Di Girolamo et al [16] show how solo collectives can be offloaded to Portals 4 [7] NICs by using triggered operations. This approach is limited by the amount of NIC resources that bounds the number of times a persistent schedule can be executed without application intervention.…”
Section: Discussionmentioning
confidence: 99%
“…A solo collective [16] is a wait-free operation, which forces the slow processes to execute the collective as soon as there is one process executing it. This process, called initiator, is in charge of informing the others to join the collective.…”
Section: Solo Collectivesmentioning
confidence: 99%
“…However, this network interface emulates limited processing capabilities. 60 A general solution was provided by Voltaire 61 which included processing support in the router for collectives; this work differs from ours in that the offload is to an in-router CPU rather than a hardware augmentation of the switch.…”
Section: Related Workmentioning
confidence: 99%
“…In this scenario, even a single delayed process a ects the job's training time. In contrast to the synchronous mode, in MPI, there is a wait-free operation, which is called partial collective communication [16]. It forces the slow processes to execute the collective communication as soon as there is one process executing it.…”
Section: Randomized Partial Collectivesmentioning
confidence: 99%