2013 IEEE 9th International Conference on E-Science 2013
DOI: 10.1109/escience.2013.34
|View full text |Cite
|
Sign up to set email alerts
|

An Algorithm for Cost-Effectively Storing Scientific Datasets with Multiple Service Providers in the Cloud

Abstract: Cloud computing provides scientists a platform that can deploy computation and data intensive applications without infrastructure investment. With excessive cloud resources and a decision support system, large generated datasets can be flexibly 1) stored locally in the current cloud, 2) deleted and regenerated whenever reused or 3) transferred to cheaper cloud service for storage. However, due to the pay-as-you-go model, the total application cost largely depends on the usage of computation, storage and bandwi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
3

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 32 publications
0
6
0
3
Order By: Relevance
“…Based on the optimised placement of data, effective and efficient task scheduling approaches can be derived [62] [63] [64] [64]. Furthermore, as numerous cloud services have been published on Internet, in order to reduce the data storage cost, less frequently used data can be moved to cheap cloud services for storage [65] [66].…”
Section: ) Methodologymentioning
confidence: 99%
“…Based on the optimised placement of data, effective and efficient task scheduling approaches can be derived [62] [63] [64] [64]. Furthermore, as numerous cloud services have been published on Internet, in order to reduce the data storage cost, less frequently used data can be moved to cheap cloud services for storage [65] [66].…”
Section: ) Methodologymentioning
confidence: 99%
“…In real world scientific applications (e.g., Montage [30], SIPHT [31], Pulsar Searching [32] [33], FEM [34] and Epigenomics [35], etc. ), the size of datasets varies from tens of megabytes to tens of gigabytes, and number of tasks varies from tens to thousands.…”
Section: Environment Setup and Parameter Settingmentioning
confidence: 99%
“…1, an enterprise-level cloud cache can be deployed in a multi-provider cloud environment where a consumer enterprise employs multiple concurrent cloud providers to host its external private clouds that are either independent from each other [13,14] or interconnected as a hybrid cloud [15]; both are the forms of federated cloud [16,17]. There are several benefits of multi-provider cloud deployment such as load balancing [18,19], enabling of planned downtime for system maintenance [17], confidentiality protection [20], risk mitigation (mandated by hospitals, stock markets, air transportation controls, etc.) [20,21] and the utilization of unique capabilities offered by different cloud providers [13].…”
Section: Introductionmentioning
confidence: 99%