Proceedings of the Sixth Annual ACM Symposium on Parallel Algorithms and Architectures - SPAA '94 1994
DOI: 10.1145/181014.181427
|View full text |Cite
|
Sign up to set email alerts
|

An architecture for optimal all-to-all personalized communication

Abstract: In all-to-all personalized communication (AAPC), every node of a parallel system sends a potentially unique packet to every other node. AAPC is an important primitive operation for modern parallel compilers, since it is used to redistribute data structures during parallel computations. As an extremely dense communication pattern, AAPC causes congestion in many types of networks and therefore executes very poorly on general purpose, asynchronous message passing routers.We present and evaluate a network architec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

1995
1995
2007
2007

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 53 publications
(21 citation statements)
references
References 21 publications
0
21
0
Order By: Relevance
“…Similar to other AAPC scheduling schemes [6], our scheme partitions AAPC into contention-free phases and fully utilizes the bandwidth in the bottleneck links in all phases. Hence, realizing AAPC with the contention-free phases can potentially achieve the minimum communication completion time.…”
Section: Switched Ethernet Is the Most Widely Used Local-area-networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Similar to other AAPC scheduling schemes [6], our scheme partitions AAPC into contention-free phases and fully utilizes the bandwidth in the bottleneck links in all phases. Hence, realizing AAPC with the contention-free phases can potentially achieve the minimum communication completion time.…”
Section: Switched Ethernet Is the Most Widely Used Local-area-networkmentioning
confidence: 99%
“…A large number of optimal message scheduling algorithms for different network topologies with different network models were developed. Many of the algorithms were designed for specific network topologies that are used in parallel machines, including hypercube [7,24], mesh [1,18,17,22], torus [6,11], k-ary n-cube [24], and fat tree [3,16]. Heuristic algorithms were developed for AAPC on irregular topologies [5,14].…”
Section: Related Workmentioning
confidence: 99%
“…connections on ring and torus topologies can be found in [3]. Using an algorithm in [3], a multiplexing degree of…”
Section: Logical Topologies and Their Performance Under Light Loadmentioning
confidence: 99%
“…Many previous works have studied the total-exchange (also called complete exchange and all-to-all personalized communication) primitive in which each processor has a distinct message to send to every other processor (see, e.g., [9], [14], [18], [31], [33], [34], [36], [41], [46] and [47]). The total-exchange primitive has been incorporated into communication libraries such as the Collective Communication Library provided with the IBM SP-2 [8].…”
Section: Modeling Parallel Bandwidthmentioning
confidence: 99%