2011 International Conference on High Performance Computing &Amp; Simulation 2011
DOI: 10.1109/hpcsim.2011.5999894
|View full text |Cite
|
Sign up to set email alerts
|

Effect of dynamic algorithm selection of Alltoall communication on environments with unstable network speed

Abstract: As the HPC systems increase their size, performance of collective communications is becoming an important issue. Usually, decisions for which algorithm of those communications to be used are done based on statically specified thresholds of the size of messages and the number of processes. However, on recent HPC systems that are hiring Fat Tree or Torus topology as their interconnect, the network speed has become unpredictable. The main reason is the effect of contentions. This effect depends heavily on the rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…As for the supercomputing system, after the system reaches a certain scale, the delay, bandwidth, and blocking of the communication between nodes are greatly affected by the network topology. In order to achieve the reasonable map between data distribution dimension and the system network topology, it is necessary to detect system data communication dynamic topology, through test sets and test system (including nodes between [10,11] designed a method called star-MPI (self-tuning adaptive routines for MPI collective operations), which can dynamically select the algorithm for ensemble communication in a network with unpredictable performance. is method tests various possible schemes and uses a certain prediction mechanism to delete the algorithm with low performance to save testing time.…”
Section: Introductionmentioning
confidence: 99%
“…As for the supercomputing system, after the system reaches a certain scale, the delay, bandwidth, and blocking of the communication between nodes are greatly affected by the network topology. In order to achieve the reasonable map between data distribution dimension and the system network topology, it is necessary to detect system data communication dynamic topology, through test sets and test system (including nodes between [10,11] designed a method called star-MPI (self-tuning adaptive routines for MPI collective operations), which can dynamically select the algorithm for ensemble communication in a network with unpredictable performance. is method tests various possible schemes and uses a certain prediction mechanism to delete the algorithm with low performance to save testing time.…”
Section: Introductionmentioning
confidence: 99%
“…Other studies provide performance analysis of point to point or collective communication on different interconnects (Ismail et al, 2011;Rashti and Afsahi, 2007) while some provide comparison and analysis of multiple algorithms for collective communication in order to find the best solution for different parallel systems (Nanri and Kurokawa, 2011;Hamid and Coddington, 2007). Other related studies focused on optimizing the performance of MPI collective communication by proposing topology aware mechanisms (Gong et al, 2013;Subramoni et al, 2011;Kandalla et al, 2010) and process arrival patterns aware mechanisms (Qian and Afsahi, 2009;Patarasuk and Yuan, 2008) to achieve the best performance in terms of time.…”
Section: Related Workmentioning
confidence: 99%