1994
DOI: 10.1007/bf02577741
|View full text |Cite
|
Sign up to set email alerts
|

Fast, contention-free combining tree barriers for shared-memory multiprocessors

Abstract: Counter-based algorithms for busy-wait barrier synchronization execute in time linear in the number of synchronizing processes. This time can be made logarithmic in the number of processes by adopting algorithms based on trees or FFT-like synchronization patterns. As an additional improvement, Gupta and Hill [5] have proposed an adaptive combining tree barrier that exploits non-uniformity in inter-barrier computation times: processes begin to leave the barrier in time logarithmic in the number of processes whe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2004
2004
2019
2019

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 35 publications
(12 citation statements)
references
References 22 publications
0
12
0
Order By: Relevance
“…Thus, many research groups have provided machineoptimized implementations and have investigated the optimal and non-trivial implementation of collective algorithms for particular machine architectures (cf. [7,13,19,34,41,43,44]). …”
Section: Introductionmentioning
confidence: 99%
“…Thus, many research groups have provided machineoptimized implementations and have investigated the optimal and non-trivial implementation of collective algorithms for particular machine architectures (cf. [7,13,19,34,41,43,44]). …”
Section: Introductionmentioning
confidence: 99%
“…When a task with both signal and wait capabilities on ph performs multiple signal operations on ph without performing a wait operation, only the first signal operation is valid and the others become no-op. This semantics intends to reduce the complexity of supporting fuzzy [110] or splitphase [111] barrier that allows local work to be performed between the signal and wait/next operations. On the other hand, multiple wait operations by such a task result in an error (PhaserException at runtime) so as to avoid deadlock.…”
Section: At This Point T I Is the Only Task Registered On Phmentioning
confidence: 99%
“…The literature convincingly shows the benefits of the advanced barrier and phaser concepts [3]- [6], [8], [10]. Thus, we will primarily focus on the actual synchronization overhead.…”
Section: Performance Evaluationmentioning
confidence: 99%