2012 IEEE 10th International Symposium on Parallel and Distributed Processing With Applications 2012
DOI: 10.1109/ispa.2012.44
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient All-to-all Communication Algorithm for Mesh/Torus Networks

Abstract: An efficient all-to-all communication algorithm for torus and mesh networks, A2AT, was proposed. A2AT schedules message sending sequence so that all links are fully used by exploiting function of concurrent message transfer in the node. By using A2AT, the hop count of messages equals the maximum number of messages sharing a link in their routes for all message transfers. A2AT can therefore maintain synchronization without the need for phasing operation such as an MPI barrier. When the VOQ which is an ideal con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…In past years, many results have been reported for meshes and tori, that have a simple and regular topology, a bounded node degree and present good scalability, see, e.g., [28], [29], [30], and [31] for meshes and [32], [33], and [34] for tori, and the more recent [35]. In particular, in [28] Scott uses the direct approach, consisting in scheduling direct communication between every pair of processors and shows that, in an a × a mesh, the number of contention-free communication steps must be at least a 3 4 .…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In past years, many results have been reported for meshes and tori, that have a simple and regular topology, a bounded node degree and present good scalability, see, e.g., [28], [29], [30], and [31] for meshes and [32], [33], and [34] for tori, and the more recent [35]. In particular, in [28] Scott uses the direct approach, consisting in scheduling direct communication between every pair of processors and shows that, in an a × a mesh, the number of contention-free communication steps must be at least a 3 4 .…”
Section: Related Workmentioning
confidence: 99%
“…Hence, both direct and indirect approaches reach the lower bound, that for meshes consisting of N processors, is O(N √ N ) time. In [35], an efficient all-toall communication algorithm implemented on a mesh/torus network is presented. The algorithm schedules sending messages in sequence so that all links are fully used and collisions are avoided, thus reducing the communication time with respect to conventional algorithms for all-to-all personalized communication on mesh and torus.…”
Section: Related Workmentioning
confidence: 99%
“…Conversely, the links near the destination nodes are drawn in green, which implies higher link utilization and no blocking. Fig.3 shows snapshots of all-to-all communication using a conventional algorithm (upper) and the new algorithm [2] (lower) in a 25 × 25 torus network. We see many blue and green links with the new algorithm, whereas there are some pink links with the conventional algorithm.…”
Section: Communication Algorithm Development Environmentmentioning
confidence: 99%
“…Many complicated scientific parallel computing applications exhibit the necessity for all-to-all communication, where every processor has to send a message to all other processors in the network ( [1], [3], [4], [9], [10], [14], [24], [25], [26], [27]). All-to-all communication is divided into two communication patterns: index (or all-to-all personalized communication) and concatenation (or all-to-all broadcast).…”
Section: Introductionmentioning
confidence: 99%