Proceedings of IEEE Scalable High Performance Computing Conference
DOI: 10.1109/shpcc.1994.296665
|View full text |Cite
|
Sign up to set email alerts
|

Interprocessor collective communication library (InterCom)

Abstract: In this paper, we outline a unified approach for building a library of collective communication operations that performs well on a cross-section of problems encountered in real applications. The target architecture is a two-dimensional mesh with worm-hole routing, but the techniques also apply to higher dimensional meshes and hypercubes. We stress a general approach, addressing the need for implementations that perform well for various sized vectors and grid dimensions, including non-power-of-two grids. This r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
59
0

Publication Types

Select...
5
5

Relationship

1
9

Authors

Journals

citations
Cited by 102 publications
(59 citation statements)
references
References 3 publications
0
59
0
Order By: Relevance
“…Sanders et al [10], Sistare et al [11] and Tipparaju et al [12] discuss various approaches to optimize collective communication algorithms for SMP clusters. Some work focuses on using different algorithms for different message size, such as [13,14]. None of previous work shows how it interacts with existing process placement approaches which are based on point-to-point communications and may also result in sub-optimal mappings.…”
Section: Related Workmentioning
confidence: 99%
“…Sanders et al [10], Sistare et al [11] and Tipparaju et al [12] discuss various approaches to optimize collective communication algorithms for SMP clusters. Some work focuses on using different algorithms for different message size, such as [13,14]. None of previous work shows how it interacts with existing process placement approaches which are based on point-to-point communications and may also result in sub-optimal mappings.…”
Section: Related Workmentioning
confidence: 99%
“…For example, op will be an addition operator for a count aggregation and a maximum operator for a max aggregation. Such collective communication for computing a final global meta array can be implemented efficiently on most parallel computers and networks of workstations [1]. Thus, the overhead for combining global meta arrays is expected to be negligible because the volume of communication is only P words per processor.…”
Section: Parallel Bucket Algorithmmentioning
confidence: 99%
“…To counter the effects of congestion, carefully designed topology aware algorithms have been used for collectives on such networks [4,5]. In addition, there is a set of topology oblivious algorithms which perform reasonably well on most systems [7,8,3]. However, most of these algorithms do not seem to be a good fit for two-tier direct networks as they may not be able to make full use of the all-to-all connectivity in two-tier direct networks.…”
Section: Introductionmentioning
confidence: 99%