2018
DOI: 10.3390/mca23020028
|View full text |Cite
|
Sign up to set email alerts
|

The Impact of the Implementation Cost of Replication in Data Grid Job Scheduling

Abstract: Data Grids deal with geographically-distributed large-scale data-intensive applications. Schemes scheduled for data grids attempt to not only improve data access time, but also aim to improve the ratio of data availability to a node, where the data requests are generated. Data replication techniques manage large data by storing a number of data files efficiently. In this paper, we propose centralized dynamic scheduling strategy-replica placement strategies (CDSS-RPS). CDSS-RPS schedule the data and task so tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 49 publications
0
3
0
Order By: Relevance
“…The PRS algorithm considers the number of accesses and storage capacity of computing nodes for replica placement. 18 A series of previous works propose replication and scheduling algorithms for specific architectures. Hierarchical architecture is trendy among researchers.…”
Section: Combined Job Scheduling and Data Replication Algorithmsmentioning
confidence: 99%
See 2 more Smart Citations
“…The PRS algorithm considers the number of accesses and storage capacity of computing nodes for replica placement. 18 A series of previous works propose replication and scheduling algorithms for specific architectures. Hierarchical architecture is trendy among researchers.…”
Section: Combined Job Scheduling and Data Replication Algorithmsmentioning
confidence: 99%
“…Parameter RFC is obtained from Equation (17) by adding the time taken to transfer the file and the waiting time in the storage disk queue. The DTC in Equation 17is the data transfer cost which is given by Equation (18) where S f is the size of requested file f, band i,k is the bandwidth of the connection links between requesting site i and replica site k, DS k is the disk speed of site k, and lat i,k is the latency between the links caused by different factors like the distance and traffic of routers. The latency is estimated by the time taken to transfer an empty packet from site i to the site k. DWQ k is the waiting time in the storage disk queue of the site k, and is calculated using Equation (19) where S fz is the size of the requested file waiting in the storage queue of site k, and n is the number of file-access requests waiting in the storage disk queue.…”
Section: Replica Selectionmentioning
confidence: 99%
See 1 more Smart Citation
“…The outcome exhibits satisfactory predictive accuracy with sufficient optimization of channel capacity. The study towards cost estimation is carried out by Nazir et al [20] while performing job scheduling over grid system. The work introduces a unique dynamic scheduling policy that is centralized towards replica placement.…”
Section: Introductionmentioning
confidence: 99%