2006 International Conference on Parallel Processing (ICPP'06)
DOI: 10.1109/icpp.2006.22
|View full text |Cite
|
Sign up to set email alerts
|

An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications

Abstract: Computationally complex applications can often be viewed as a collection of coarse-grained data-parallel tasks with precedence constraints. Researchers have shown that combining task and data parallelism (mixed parallelism) can be an effective approach for executing these applications, as compared to pure task or data parallelism. In this paper, we present an approach to determine the appropriate mix of task and data parallelism, i.e., the set of tasks that should be run concurrently and the number of processo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(28 citation statements)
references
References 19 publications
0
28
0
Order By: Relevance
“…Several practical PTG scheduling algorithms based on heuristics have been proposed in the literature [6], [8], [9], [10], [11]. Like the guaranteed algorithms discussed earlier, the algorithms in [6], [8], [9], [10] proceed in two phases.…”
Section: Single Homogeneous Clustermentioning
confidence: 99%
See 2 more Smart Citations
“…Several practical PTG scheduling algorithms based on heuristics have been proposed in the literature [6], [8], [9], [10], [11]. Like the guaranteed algorithms discussed earlier, the algorithms in [6], [8], [9], [10] proceed in two phases.…”
Section: Single Homogeneous Clustermentioning
confidence: 99%
“…These last three algorithms all use a list-scheduling-based task mapping phase by which tasks are mapped to processors in order of decreasing "bottom level" (i.e., distance to the PTG's exit task), accounting for data communication and data redistribution costs. The iCASLB one-step algorithm in [11] was shown to lead to better performance than some two-step algorithms, including CPA, while maintaining reasonable complexity. This algorithm performs allocation and mapping simultaneously by iteratively increasing the allocations of tasks on the critical path, with a look-ahead mechanism to avoid being trapped in local minima, and a backfilling approach to improve the schedule.…”
Section: Single Homogeneous Clustermentioning
confidence: 99%
See 1 more Smart Citation
“…Then overlap j1, j2 is extracted as shown in Table 1. Since Tile 1 and Tile 5 are overlapped, overlap 1,5 and overlap 5,1 take 1. On the other hand, overlap 1,6 and overlap 6,1 take 0 because Tile 1 and Tile 6 do not share any cores.…”
Section: A Greedy Algorithmmentioning
confidence: 99%
“…In other words, an application is assigned a single core. Techniques presented in [2]- [5] take into account data parallelism within applications (intra-application parallelism) as well as application-level parallelism (interapplication parallelism). Their methods perform scheduling and mapping simultaneously, aiming at minimization of schedule length or pipeline throughput.…”
Section: Related Workmentioning
confidence: 99%