2010 IEEE International Symposium on Parallel &Amp; Distributed Processing, Workshops and PHD Forum (IPDPSW) 2010
DOI: 10.1109/ipdpsw.2010.5470853
|View full text |Cite
|
Sign up to set email alerts
|

Designing topology-aware collective communication algorithms for large scale InfiniBand clusters: Case studies with Scatter and Gather

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
42
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 60 publications
(42 citation statements)
references
References 12 publications
0
42
0
Order By: Relevance
“…Most of the previous work [2,12,27,25,26,18,13,16] addresses congestion in the core (switches) of HPC networks. As our experimental evaluation shows, the advent of multicore processors introduces congestion at the edge of these networks and mechanisms to handle Concurrency Congestion are required for best performance on contemporary hardware.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Most of the previous work [2,12,27,25,26,18,13,16] addresses congestion in the core (switches) of HPC networks. As our experimental evaluation shows, the advent of multicore processors introduces congestion at the edge of these networks and mechanisms to handle Concurrency Congestion are required for best performance on contemporary hardware.…”
Section: Discussionmentioning
confidence: 99%
“…Dvorak et al [13] described techniques for topology aware scheduling of many-to-many collective operations. Kandalla et al [16] discussed topology aware scatter and gather for large scale InfiniBand clusters. Thakur et al [22] discussed the scalability of MPI collectives and described implementations that use multiple algorithms in order to alleviate congestion in data intensive operations such as all-to-all.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Other related studies focused on optimizing the performance of MPI collective communication by proposing topology aware mechanisms (Gong et al, 2013;Subramoni et al, 2011;Kandalla et al, 2010) and process arrival patterns aware mechanisms (Qian and Afsahi, 2009;Patarasuk and Yuan, 2008) to achieve the best performance in terms of time.…”
Section: Related Workmentioning
confidence: 99%
“…Recent works [3], [4], [5], [6], [8], [9], [10], [11] have shown substantial communication performance improvement on large parallel machines by suitable assignment of processes or tasks to nodes of the machine. Earlier works on graph embedding are usually not suitable for modern machines because the earlier works used metrics suitable for a store-and-forward communication mechanism.…”
Section: Introductionmentioning
confidence: 99%