1997
DOI: 10.1006/jpdc.1997.1351
|View full text |Cite
|
Sign up to set email alerts
|

Fast Runtime Block Cyclic Data Redistribution on Multiprocessors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0
2

Year Published

1999
1999
2014
2014

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(37 citation statements)
references
References 15 publications
0
35
0
2
Order By: Relevance
“…In many applications, data redistribution may be needed if elements of a data set are inserted or deleted at the end of the array. In particular, algorithms to redistribute data using a new block size exist for the Block Cyclic distribution [14,19]. If an application uses a dynamic data set with elements that are appended, a Cyclic or Block Cyclic distribution is superior to Block because new elements are added to the locale that follows the cyclic or block-cyclic pattern.…”
Section: Chapel's Data Distributionsmentioning
confidence: 99%
“…In many applications, data redistribution may be needed if elements of a data set are inserted or deleted at the end of the array. In particular, algorithms to redistribute data using a new block size exist for the Block Cyclic distribution [14,19]. If an application uses a dynamic data set with elements that are appended, a Cyclic or Block Cyclic distribution is superior to Block because new elements are added to the locale that follows the cyclic or block-cyclic pattern.…”
Section: Chapel's Data Distributionsmentioning
confidence: 99%
“…the computation-to-communication ratio. Because this granularity changes from one computational kernel to the other, moving from a CYCLIC(r) distribution over p processors to a CYCLIC(s) distribution over q processors is a very useful redistribution procedure, which has been implemented using a caterpillar algorithm in ScaLAPACK [34]. Several papers, including [23,39,14,33,19,11,24], have dealt with various optimizations of this redistribution procedure.…”
Section: Related Workmentioning
confidence: 99%
“…Based on the intersections, the send/receive processor/data sets can be determined and general redistribution algorithms can be devised. Prylli and Touranchean [21] proposed a runtime scan algorithm for BLOCK-CYCLIC array redistribution. Their approach has the same time complexity as that proposed in [23] but has a simple basic operation compared to that proposed in [23].…”
Section: Related Workmentioning
confidence: 99%
“…In some algorithms, such as multidimensional fast Fourier transform [29], the Alternative Direction Implicit (ADI) method for solving two-dimensional diffusion equations, and linear algebra solvers [21], an array distribution that is well suited for one phase may not be good for a subsequent phase in terms of performance. Array redistribution is required for those algorithms at runtime.…”
Section: Introductionmentioning
confidence: 99%