Effect of dynamic algorithm selection of Alltoall communication on environments with unstable network speed

Nanri, Takeshi; Kurokawa, Motoyoshi

doi:10.1109/hpcsim.2011.5999894

Cited by 2 publications

(2 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As for the supercomputing system, after the system reaches a certain scale, the delay, bandwidth, and blocking of the communication between nodes are greatly affected by the network topology. In order to achieve the reasonable map between data distribution dimension and the system network topology, it is necessary to detect system data communication dynamic topology, through test sets and test system (including nodes between [10,11] designed a method called star-MPI (self-tuning adaptive routines for MPI collective operations), which can dynamically select the algorithm for ensemble communication in a network with unpredictable performance. is method tests various possible schemes and uses a certain prediction mechanism to delete the algorithm with low performance to save testing time.…”

Section: Introductionmentioning

confidence: 99%

Communication Optimization Technology Based on Network Dynamic Performance Model

Cui

Wang

2020

Mathematical Problems in Engineering

View full text Add to dashboard Cite

This work analyses different communication modes in applications of supercomputing, proposes a communication dynamic performance model based on topology awareness, and realizes the prototype system of all-to-all communication and stencil communication optimization based on this model. Basic tests on the optimization of all-to-all communication and stencil communication were carried out on the Sunway TaihuLight System, and this achieved obvious optimization results. Several applications, including molecular dynamics simulation and turbulence simulation, have been optimized and tested. The average performance has been improved obviously. It can be expected that, for other large-scale applications, this optimization method can also be used to obtain significant improvement in communication performance.

show abstract

Section: Introductionmentioning

confidence: 99%

Communication Optimization Technology Based on Network Dynamic Performance Model

Cui

Wang

2020

Mathematical Problems in Engineering

View full text Add to dashboard Cite

show abstract

“…Other studies provide performance analysis of point to point or collective communication on different interconnects (Ismail et al, 2011;Rashti and Afsahi, 2007) while some provide comparison and analysis of multiple algorithms for collective communication in order to find the best solution for different parallel systems (Nanri and Kurokawa, 2011;Hamid and Coddington, 2007). Other related studies focused on optimizing the performance of MPI collective communication by proposing topology aware mechanisms (Gong et al, 2013;Subramoni et al, 2011;Kandalla et al, 2010) and process arrival patterns aware mechanisms (Qian and Afsahi, 2009;Patarasuk and Yuan, 2008) to achieve the best performance in terms of time.…”

Section: Related Workmentioning

confidence: 99%

Performance Analysis of Message Passing Interface Collective Communication on Intel Xeon Quad-Core Gigabit Ethernet and Infiniband Clusters

Ismail¹,

Hamid²,

Othman³

et al. 2013

Journal of Computer Science

View full text Add to dashboard Cite

The performance of MPI implementation operations still presents critical issues for high performance computing systems, particularly for more advanced processor technology. Consequently, this study concentrates on benchmarking MPI implementation on multi-core architecture by measuring the performance of Open MPI collective communication on Intel Xeon dual quad-core Gigabit Ethernet and InfiniBand clusters using SKaMPI. It focuses on well known collective communication routines such as MPI-Bcast, MPI-AlltoAll, MPI-Scatter and MPI-Gather. From the collection of results, MPI collective communication on InfiniBand clusters had distinctly better performance in terms of latency and throughput. The analysis indicates that the algorithm used for collective communication performed very well for all message sizes except for MPI-Bcast and MPI-Alltoall operation of inter-node communication. However, InfiniBand provides the lowest latency for all operations since it provides applications with an easy to use messaging service, compared to Gigabit Ethernet, which still requests the operating system for access to one of the server communication resources with the complex dance between an application and a network.

show abstract

Effect of dynamic algorithm selection of Alltoall communication on environments with unstable network speed

Cited by 2 publications

References 9 publications

Communication Optimization Technology Based on Network Dynamic Performance Model

Communication Optimization Technology Based on Network Dynamic Performance Model

Performance Analysis of Message Passing Interface Collective Communication on Intel Xeon Quad-Core Gigabit Ethernet and Infiniband Clusters

Contact Info

Product

Resources

About