2007
DOI: 10.1016/j.future.2006.04.009
|View full text |Cite
|
Sign up to set email alerts
|

Mining performance data for metascheduling decision support in the Grid

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
33
0

Year Published

2007
2007
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 28 publications
(33 citation statements)
references
References 10 publications
0
33
0
Order By: Relevance
“…Predictions of performance metrics, such as application run times and queue wait times on clusters, serve as important information for scheduling decision making at the Grid level. The main patterns that we identify for data-intensive clusters, namely periodicity, long range dependence, and temporal locality, suggest that prediction techniques based on historical data modeling would most likely work on real production systems [21,15]. The Grid-level scheduling strategies can also take advantages of specific VO job arrival patterns.…”
Section: Modeling and Predictionsmentioning
confidence: 91%
“…Predictions of performance metrics, such as application run times and queue wait times on clusters, serve as important information for scheduling decision making at the Grid level. The main patterns that we identify for data-intensive clusters, namely periodicity, long range dependence, and temporal locality, suggest that prediction techniques based on historical data modeling would most likely work on real production systems [21,15]. The Grid-level scheduling strategies can also take advantages of specific VO job arrival patterns.…”
Section: Modeling and Predictionsmentioning
confidence: 91%
“…Li et al [28] present an Instance Based Learning technique to forecast response times of jobs in grids by means of historical performance data mining. This approach is based on the definition of similarity between jobs.…”
Section: B Multi-stage Predictor Evaluationmentioning
confidence: 99%
“…Weighted Average (WA) and Locally Weighted Linear Regression (LLWR) are used as the candidate induction models for predictions. We refer to [9] for details and formulations of the basic prediction algorithm.…”
Section: The Basic Prediction Algorithmmentioning
confidence: 99%
“…The sequential search is relatively slow as it has to calculate distances with all entries in the history base. Since it involves resource state attributes, the distance calculations for queue wait times are much more expensive and it cannot employ caching like run times without compromising accuracy [9]. To improve performance a different access structure is needed and we investigate M-Tree in this context.…”
Section: Nearest Neighbor Searchmentioning
confidence: 99%