Proceedings of 8th International Parallel Processing Symposium
DOI: 10.1109/ipps.1994.288208
|View full text |Cite
|
Sign up to set email alerts
|

CCL: a portable and tunable collective communication library for scalable parallel computers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 37 publications
(12 citation statements)
references
References 22 publications
0
12
0
Order By: Relevance
“…Several algorithms for improving the performance of collective communications have been proposed for decades [1], [2], [3], [4], [5]. More recently, some researchers have focused their efforts on taking advantage of existing algorithms in order to find techniques for selecting the most efficient algorithm for any given system/workload configuration.…”
Section: Related Workmentioning
confidence: 99%
“…Several algorithms for improving the performance of collective communications have been proposed for decades [1], [2], [3], [4], [5]. More recently, some researchers have focused their efforts on taking advantage of existing algorithms in order to find techniques for selecting the most efficient algorithm for any given system/workload configuration.…”
Section: Related Workmentioning
confidence: 99%
“…Figure 7 measures the average skew per frame in milliseconds. The total skew is calculated by taking the square root of the sum of the square of the differences between retrieval times of items from the two consumers 3 . Clearly channel groups lead to significantly lower skew and, although these results are somewhat obvious, the measurements simply provide a compact characterization of the performance of channel groups and show channel groups do provide viable synchronization in a realistic application scenario.…”
Section: Preliminary Performancementioning
confidence: 99%
“…Indirect algorithms for collective communication have also been addressed in [1]. Here, interprocessor data communication is performed in a "combine and forward" manner.…”
Section: Cyclic(x)mentioning
confidence: 99%
“…The communication overheads can be represented using an analytical model of typical distributed memory machines, the General purpose Distributed Memory (GDM) model [24]. Similar models are reported in the literature [1], [3], [4]. The GDM model represents the communication time of a message passing operation using two parameters: the start-up time T d and the unit data transmission time τ d .…”
Section: The Cost Of Redistributionmentioning
confidence: 99%
See 1 more Smart Citation