18th International Parallel and Distributed Processing Symposium, 2004. Proceedings.
DOI: 10.1109/ipdps.2004.1303051
|View full text |Cite
|
Sign up to set email alerts
|

Data partitioning with a realistic performance model of networks of heterogeneous computers

Abstract: The paper presents a performance model of a network of heterogeneous computers that takes account of the heterogeneity of memory structure and other architectural differences. Under this model, the speed of each processor is represented by a function of the size of the problem whereas standard models use single numbers to represent the speeds of the processors. We prove that this model is more realistic than the standard ones when the network includes computers with significantly different memory structure. We… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
51
0

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 33 publications
(51 citation statements)
references
References 3 publications
0
51
0
Order By: Relevance
“…Figure 5 (a) shows the speedup of the matrix-matrix multiplication executed on this network using the advanced model over the matrix-matrix multiplication using the modified version of the standard model that determines the speed of the processor based on the multiplication of two dense 500×500 matrices and two dense 4000×4000 matrices. For problem sizes beyond 24000, the figure shows that the distribution given by the performance model [1] will result in failure of the application. For these problem sizes, the modified performance model is used to obtain optimal distribution.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Figure 5 (a) shows the speedup of the matrix-matrix multiplication executed on this network using the advanced model over the matrix-matrix multiplication using the modified version of the standard model that determines the speed of the processor based on the multiplication of two dense 500×500 matrices and two dense 4000×4000 matrices. For problem sizes beyond 24000, the figure shows that the distribution given by the performance model [1] will result in failure of the application. For these problem sizes, the modified performance model is used to obtain optimal distribution.…”
Section: Resultsmentioning
confidence: 99%
“…Some of the issues with programming applications on such networks of heterogeneous computers have been explained in [1]. These are mainly:…”
Section: Indexmentioning
confidence: 99%
See 3 more Smart Citations