2006
DOI: 10.1016/j.jpdc.2005.10.008
|View full text |Cite
|
Sign up to set email alerts
|

CEFT: A cost-effective, fault-tolerant parallel virtual file system

Abstract: The vulnerability of computer nodes due to component failures is a critical issue for cluster-based file systems. This paper studies the development and deployment of mirroring in cluster-based parallel virtual file systems to provide fault tolerance and analyzes the tradeoffs between the performance and the reliability in the mirroring scheme. It presents the design and implementation of CEFT, a scalable RAID-10 style file system based on PVFS, and proposes four novel mirroring protocols depending on whether … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

1
8
0

Year Published

2006
2006
2015
2015

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 60 publications
1
8
0
Order By: Relevance
“…This paper extends our previous studies presented in [13,14,16] and incorporates more experiments to evaluate our proposed approach. Based on the experimental results collected from a real cluster in production mode, this paper helps shed light on the following important design and performance issues: (1) What is the impact of resource contention on the aggregate storage throughput?…”
supporting
confidence: 55%
“…This paper extends our previous studies presented in [13,14,16] and incorporates more experiments to evaluate our proposed approach. Based on the experimental results collected from a real cluster in production mode, this paper helps shed light on the following important design and performance issues: (1) What is the impact of resource contention on the aggregate storage throughput?…”
supporting
confidence: 55%
“…As data throughput is the most important objective of PVFS, some expensive but indispensable functions such as the concurrent control between data and metadata are not fully designed and implemented. In CEFT [6], [10], [13], [17], which is an extension of PVFS to incorporate a RAID-10-style fault tolerance and parallel I/O scheduling, the MS synchronizes concurrent updates, which can limit the overall throughput under the workload of intensive concurrent metadata updates. In Lustre [1], some low-level metadata management tasks are offloaded from the MS to object storage devices, and ongoing efforts are being made to decentralize metadata management to further improve the scalability.…”
Section: Related Work and Comparison Of Decentralization Schemesmentioning
confidence: 99%
“…R APID advances in general-purpose communication networks have motivated the deployment of inexpensive components to build competitive cluster-based storage solutions to meet the increasing demand of scalable computing [1], [2], [3], [4], [5], [6]. In the recent years, the bandwidth of these networks has been increased by two orders of magnitude [7], [8], [9], which greatly narrows the performance gap between them and the dedicated networks used in commercial storage systems.…”
Section: Introductionmentioning
confidence: 99%
“…Scientific applications usually need to input and output large amounts of data from secondary storage systems [5]. In order to alleviate the I/O bottleneck, cluster supercomputers usually use high-end storage servers with large capacity of main memory.…”
Section: Introductionmentioning
confidence: 99%