2007
DOI: 10.1177/1094342006074864
|View full text |Cite
|
Sign up to set email alerts
|

Data Partitioning with a Functional Performance Model of Heterogeneous Processors

Abstract: In this paper, we address the problem of optimal distribution of computational tasks on a network of heterogeneous computers when one or more tasks do not fit into the main memory of the processors and when relative speeds vary with the problem size. We propose a functional performance model of heterogeneous processors that integrates many essential features of a network of heterogeneous computers having a major impact on its performance such as the processor heterogeneity, the heterogeneity of memory structur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
93
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
3
3
2

Relationship

5
3

Authors

Journals

citations
Cited by 64 publications
(93 citation statements)
references
References 18 publications
0
93
0
Order By: Relevance
“…A set of p curves on this plane (as shown in Figure 3 (b)) will represent the absolute speeds of the processors against variable y given parameter x is fixed; • Apply the set partitioning algorithm [4] to this set of p curves to obtain an optimal distribution.…”
Section: Data Partitioning Algorithm (Dpa-fpm-2d)mentioning
confidence: 99%
See 1 more Smart Citation
“…A set of p curves on this plane (as shown in Figure 3 (b)) will represent the absolute speeds of the processors against variable y given parameter x is fixed; • Apply the set partitioning algorithm [4] to this set of p curves to obtain an optimal distribution.…”
Section: Data Partitioning Algorithm (Dpa-fpm-2d)mentioning
confidence: 99%
“…The problem of distributing independent chunks of computations over a onedimensional arrangement of heterogeneous processors using this FPM has been studied in [4]. It can be formulated as follows: Given n independent chunks of computations, each of equal size (i.e., each requiring the same amount of work), how can we assign these chunks to p (p<n) physical processors P 1 , P 2 , ..., P p with their respective full FPMs represented by speed functions s 1 (x), s 2 (x), ..., s p (x) so that the workload is best balanced?…”
Section: Introductionmentioning
confidence: 99%
“…It has been shown in [13] that it is more accurate to represent performance as a function of problem size, which reflects contributions from both processor and memory. In this paper, we propose a new dynamic load balancing algorithm based on partial functional performance models of processors [3].…”
Section: Related Workmentioning
confidence: 99%
“…Our dynamic load balancing algorithm is based on functional performance models [13], which are application centric and hardware specific. Functional performance models reflect both processor and memory heterogeneity.…”
mentioning
confidence: 99%
“…This type of parallel application is often used in practice, for example, in processing of a large amount of image data collected from the hyperspectral sensors on airborne/satellite platforms (Plaza et al 2006). Our application multiplies two dense square matrices, C = A × B, and employs a simple heterogeneous parallel algorithm based on one-dimensional matrix partitioning (see, for example, Lastovetsky and Reddy 2007). As shown in Figure 5, the matrices A and C are horizontally sliced such that the number of elements in a slice is proportional to the speed of the processor owning the slice.…”
Section: N Estimation Of Communication Modelsmentioning
confidence: 99%