Proceedings of the 26th ACM International Conference on Supercomputing 2012
DOI: 10.1145/2304576.2304594
|View full text |Cite
|
Sign up to set email alerts
|

Congestion avoidance on manycore high performance computing systems

Abstract: Efficient communication is a requirement for application scalability on High Performance Computing systems. In this paper we argue for incorporating proactive congestion avoidance mechanisms into the design of communication layers on manycore systems. This is in contrast with the status quo which employs a reactive approach, e.g. congestion control mechanisms are activated only when resources have been exhausted. We present a core stateless optimization approach based on open loop end-point throttling, impleme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 21 publications
(11 citation statements)
references
References 28 publications
0
11
0
Order By: Relevance
“…For large messages, parallelization does not improve performance when compared to the original setting, yet there is a noticeable difference from peak bandwidth. In this case orthogonal concurrency throttling techniques as described by Luo [17] are required. Note that due to decoupling the communication into a standalone subsystem, these techniques are easy to implement in our architecture.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For large messages, parallelization does not improve performance when compared to the original setting, yet there is a noticeable difference from peak bandwidth. In this case orthogonal concurrency throttling techniques as described by Luo [17] are required. Note that due to decoupling the communication into a standalone subsystem, these techniques are easy to implement in our architecture.…”
Section: Discussionmentioning
confidence: 99%
“…For well balanced applications, there usually exists communication concurrency, even enough to cause congestion [17]. In this case throttling the spatial concurrency of communication improves performance.…”
Section: Communication and Concurrencymentioning
confidence: 99%
“…With strong scaling, messages become large at low concurrency. As indicated by Luo et al [27] on the networks we use (Aries and In-finiBand) issuing simultaneously a large number of large messages degrades performance.…”
Section: Casementioning
confidence: 93%
“…While there is little related work on the endpoints as introduced in [10], a large body of work exists on the exploitation of shared memory nodes within MPI or other parallel programming languages like UPC [7], [9], [13]. Many of these papers focus on optimizing various communication primitives by means of a shared memory region.…”
Section: Related Workmentioning
confidence: 99%
“…The main program (Line 3) creates the PAMI client and a number of contexts. Our simple example creates a thread to handle each context (lines [11][12][13]. Each thread receives the context ID as an argument (not shown for brevity).…”
Section: A Interfacementioning
confidence: 99%