2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (Ccgrid 2012) 2012
DOI: 10.1109/ccgrid.2012.109
|View full text |Cite
|
Sign up to set email alerts
|

A Workflow-Aware Storage System: An Opportunity Study

Abstract: -This paper evaluates the potential gains a workflow-aware storage system can bring. Two observations make us believe such storage system is crucial to efficiently support workflow-based applications: First, workflows generate irregular and application-dependent data access patterns. These patterns render existing storage systems unable to harness all optimization opportunities as this often requires conflicting optimization options or even conflicting design decision at the level of the storage system. Second… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
4

Relationship

4
5

Authors

Journals

citations
Cited by 24 publications
(18 citation statements)
references
References 22 publications
0
18
0
Order By: Relevance
“…The details of workflow enactment depends on the precise data access semantics and API offered by this intermediate space. At one end of the spectrum are intermediate storage systems that offer a shared file-system with complete POSIX API support (e.g., MosaStore [1,21], and Chirp/Parrot [20]). Whereas, at the other end of the spectrum are two overlapping solutions: (1) solutions that offer a custom API, possibly specialized for classes of applications (e.g., the Hadoop File System -HDFS); and, (2) solutions that do not offer a shared name and storage space and leave explicit data movement between independent storage nodes and space management to be handled by the workflow scheduler.…”
Section: Storage Systemsmentioning
confidence: 99%
“…The details of workflow enactment depends on the precise data access semantics and API offered by this intermediate space. At one end of the spectrum are intermediate storage systems that offer a shared file-system with complete POSIX API support (e.g., MosaStore [1,21], and Chirp/Parrot [20]). Whereas, at the other end of the spectrum are two overlapping solutions: (1) solutions that offer a custom API, possibly specialized for classes of applications (e.g., the Hadoop File System -HDFS); and, (2) solutions that do not offer a shared name and storage space and leave explicit data movement between independent storage nodes and space management to be handled by the workflow scheduler.…”
Section: Storage Systemsmentioning
confidence: 99%
“…HDFS [6] exposes data locality through a customized remote procedure call to the Hadoop execution engine. MosaStore [1,30] takes the approach of embedding data locality in the extended attributes of the POSIX standard. Both approaches require that the execution engine process locality information in order to make it transparent to programmers.…”
Section: Related Workmentioning
confidence: 99%
“…Replica synchronization is done in the MDS of MosaStore. The Gfarm file system [10] the replication mechanism is used for data replication for the reliability and availability. In the distributed and parallel file system, the MDS controls the data replication and send the data to the storage servers; this makes pressure to the MDS.…”
Section: Related Workmentioning
confidence: 99%