Scheduling Parallel Task Graphs on (Almost) Homogeneous Multicluster Platforms

Dutot, Pierre-François; N'Takpé, Tchimou; Suter, Frédéric; Casanova, Henri

doi:10.1109/tpds.2009.11

Cited by 44 publications

(24 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Weaknesses in both HCPA (and thus in CPA) and M-HEFT were identified and remedied in [22], which performs a thorough comparison of both improved algorithms. In the case of a homogeneous multi-cluster platform, an algorithm to schedule a single PTG was recently proposed in [13]. This algorithm is based on a resource allocation algorithm that provides a performance guarantee, but has high computational complexity.…”

Section: Related Workmentioning

confidence: 99%

On cluster resource allocation for multiple parallel task graphs

Casanova

Desprez

Suter

2010

Journal of Parallel and Distributed Computing

View full text Add to dashboard Cite

a b s t r a c tMany scientific applications can be structured as parallel task graphs (PTGs), that is, graphs of data-parallel tasks. Adding data parallelism to a task-parallel application provides opportunities for higher performance and scalability, but poses additional scheduling challenges. In this paper, we study the off-line scheduling of multiple PTGs on a single, homogeneous cluster. The objective is to optimize performance without compromising fairness among the PTGs. We consider the range of previously proposed scheduling algorithms applicable to this problem, from both the applied and the theoretical literature, and we propose minor improvements when possible. Our main contribution is an extensive evaluation of these algorithms in simulation, using both synthetic and real-world application configurations, using two different metrics for performance and one metric for fairness. We identify a handful of algorithms that provide good trade-offs when considering all these metrics. The best algorithm overall is one that structures the schedule as a sequence of phases of increasing duration based on a makespan guarantee produced by an approximation algorithm.

show abstract

Section: Related Workmentioning

confidence: 99%

On cluster resource allocation for multiple parallel task graphs

Casanova

Desprez

Suter

2010

Journal of Parallel and Distributed Computing

View full text Add to dashboard Cite

show abstract

“…This result was improved in [11], leading to a ∼4.73 performance ratio in the general case. The algorithm proposed in [10] was implemented and compared to HCPA in [12]. It was shown that non-guaranteed algorithms were competitive with the guaranteed one on the average but with tremendously shorter scheduling times.…”

Section: Related Workmentioning

confidence: 99%

“…Note that this inner loop actually corresponds to an interval of iterations of the seminal allocation procedure of CPA, as shown in Figure 1. Each time T CP ≤ T ′ A , the current allocation is stored for each task (lines [11][12][13]. At the end of this procedure, P different allocations are associated with each task in the PTG.…”

Section: B the Bicpa Algorithmmentioning

confidence: 99%

A Bi-criteria Algorithm for Scheduling Parallel Task Graphs on Clusters

Desprez

Suter

2010

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing

View full text Add to dashboard Cite

Applications structured as parallel task graphs exhibit both data and task parallelism, and arise in many domains. Scheduling these applications on parallel platforms has been a long-standing challenge. In the case of a single homogeneous cluster, most of the existing algorithms focus on the reduction of the application completion time (makespan). But in presence of resource managers such as batch schedulers and due to accentuated pressure on energy concerns, the produced schedules also have to be efficient in terms of resource usage. In this paper we propose a novel bi-criteria algorithm, called biCPA, able to optimize these two performance metrics either simultaneously or separately. Using simulation over a wide range of experimental scenarios, we find that biCPA leads to better results than previously published algorithms.

show abstract

“…Research in this area has not considered such parallel tasks as part of a workflow containing task dependencies. For workflow scheduling, existing solutions restrict the execution of parallel tasks to a single cluster [8] [3]. This is due to the support of high interconnection speed, high bandwidth and low latency networks such as Infiniband [9], and the support of Network File System (NFS) which provides easy and transparent access to data from any of the parallel process.…”

Section: Introductionmentioning

confidence: 99%

Scheduling Workflows in Multi-cluster Environments

Stanzani

Sato

Netto

2013

2013 27th International Conference on Advanced Information Networking and Applications Workshops

View full text Add to dashboard Cite

Scientific applications modeled as workflows can exhibit both task and data parallelism. Scheduling these workflows in a multi-cluster environment is challenging due to the large number of task mapping possibilities. Therefore, several heuristics have been proposed over the last years to address such a problem. A key limitation of existing heuristics for multi-cluster environments is that individual tasks are mapped onto single resources, which limits the resource options to reduce the time to the complete workflow executions. This paper introduces the Multi-Cluster Allocation-Heterogeneous Earliest Finish Time (MCA-HEFT) heuristic, which deploys single parallel tasks of a workflow into multiple clusters and schedules them accordingly. We evaluated MCA-HEFT against the Mixed-parallel Heterogeneous Earliest Finish Time (M-HEFT) heuristic, which is one of the most well-known workflow scheduling heuristics in literature. MCA-HEFT was able to produce makespans that were up to 42% shorter than those produced by M-HEFT, having only approximately 10% of tasks distributed on multiple clusters. Our experiments considered several metrics and parameters including critical path size, makespan, number of clusters used to execute tasks, and the network impact when deploying the tasks in multiple clusters.

show abstract

Scheduling Parallel Task Graphs on (Almost) Homogeneous Multicluster Platforms

Cited by 44 publications

References 26 publications

On cluster resource allocation for multiple parallel task graphs

On cluster resource allocation for multiple parallel task graphs

A Bi-criteria Algorithm for Scheduling Parallel Task Graphs on Clusters

Scheduling Workflows in Multi-cluster Environments

Contact Info

Product

Resources

About