2008
DOI: 10.1109/ipdps.2008.4536141
|View full text |Cite
|
Sign up to set email alerts
|

Scaling alltoall collective on multi-core systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2008
2008
2020
2020

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(16 citation statements)
references
References 7 publications
0
16
0
Order By: Relevance
“…Our approach cannot however be applied to some hierarchical algorithms that use multiple leaders per node for inter-node communication [10,8]. This approach has the advantage of distributing the leaders load across multiple cores.…”
Section: Discussion and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Our approach cannot however be applied to some hierarchical algorithms that use multiple leaders per node for inter-node communication [10,8]. This approach has the advantage of distributing the leaders load across multiple cores.…”
Section: Discussion and Related Workmentioning
confidence: 99%
“…Intranode optimizations are now often combined with inter-node communication within hierarchical algorithms. For an Alltoall operation, this idea may be implemented through an intra-node Alltoall on each node, followed by inter-node Alltoall between all groups of corresponding local ranks [10].…”
Section: Collective Operations On Many-core Clustersmentioning
confidence: 99%
“…MPI [22,17] and parallel programming languages such as UPC [19] provide optimized implementations of all-to-all collective operations. Most if not all of the existing implementations use multiple algorithms selected by message size.…”
Section: All-to-all Performancementioning
confidence: 99%
“…Much work has been done to improve MPI performance on SMP-CMP clusters. Techniques were developed to improve both point-to-point communications [7], [9], [10], [12] and collective communications [14], [15], [17], [19]. These optimizations further differentiate the communication performance in the multi-layer communication infrastructure in SMP-CMP clusters and manifest the impacts of processor affinity.…”
Section: Related Workmentioning
confidence: 99%