2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing 2010
DOI: 10.1109/ccgrid.2010.9
|View full text |Cite
|
Sign up to set email alerts
|

ConnectX-2 InfiniBand Management Queues: First Investigation of the New Support for Network Offloaded Collective Operations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 35 publications
(17 citation statements)
references
References 9 publications
0
17
0
Order By: Relevance
“…In this figure and all subsequent figures MQ is used to label results obtained using HCA offloading approaches, and PTP is used to label results obtained with point-to-point based collective algorithms progressed by the CPU in main memory. While results for the barrier algorithm have been presented previously [19], the current results are significantly improved at the larger process counts. This isa result of the reduce number of queue pairs used in the algorithm, decreasing the number of network context resources the HCA needs to manage.…”
Section: B Discussionmentioning
confidence: 64%
See 1 more Smart Citation
“…In this figure and all subsequent figures MQ is used to label results obtained using HCA offloading approaches, and PTP is used to label results obtained with point-to-point based collective algorithms progressed by the CPU in main memory. While results for the barrier algorithm have been presented previously [19], the current results are significantly improved at the larger process counts. This isa result of the reduce number of queue pairs used in the algorithm, decreasing the number of network context resources the HCA needs to manage.…”
Section: B Discussionmentioning
confidence: 64%
“…A detailed description of the CORE-Direct support and how this is used to implement support for MPI collective operations in Open MPI [18] is described elsewhere [19]. In this section we provide a brief description of these, as well as very recent enhancements to the MPI support.…”
Section: An Overview Of Infinibandmentioning
confidence: 99%
“…The newest generation of InfiniBand [24] adapters from Mellanox provide the Collective Offload Resource Engine, CORE-Direct [25]. This adds a management queue to the standard InfiniBand queue pair.…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…Network Offload Architectures Some newer network architectures such as Portals IV [7] or CORE-Direct [14] allow to offload collective operations to the network device. This enables faster execution (messages do not need to travel to the CPU) and isolation (computations on the CPU and collective communications do not interfere and can progress independently).…”
Section: T Hoefler D Moormentioning
confidence: 99%
“…To offload a collective operation to a network device, one copies some state (e.g., a set of triggers [7] or a set of management queue entries [14]) that models the execution schedule to the device. The device then generates messages based on arriving messages from other processes and the local state without CPU involvement.…”
Section: T Hoefler D Moormentioning
confidence: 99%