Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference 2009
DOI: 10.1145/1534530.1534552
|View full text |Cite
|
Sign up to set email alerts
|

Improving communication-phase completion times in HPC clusters through congestion mitigation

Abstract: Congestion arises in cluster-based supercomputers due to contention for links, spreads due to oversubscription of communication resources, and reduces performance. We mitigate it using efficient, scalable adaptive routing and explicit rate calculation. We use virtual circuits for in-order packet delivery; path setup is performed by switches locally with no blocking or backtracking. For random permutations in a slightly enriched fat-tree topology, maximum contention is reduced by up to 50% relative to static ro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
references
References 34 publications
0
0
0
Order By: Relevance