Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing 2013
DOI: 10.1145/2462902.2462908
|View full text |Cite
|
Sign up to set email alerts
|

A 1 PB/s file system to checkpoint three million MPI tasks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…In addition, with the increasing scale of applications, they need to write larger checkpoint data more frequently (Sato et al, 2014). This generates an enormous amount of write traffic to storage systems (Rajachandrasekar et al, 2013). Much work (Ali et al, 2009;Dongarra, 2010;Shalf et al, 2010;Bent et al, 2012;Lofstead et al, 2016) has shown that current HDD-based storage systems have been stretched to their limits in handling the tremendous amount of I/O.…”
Section: Introductionmentioning
confidence: 99%
“…In addition, with the increasing scale of applications, they need to write larger checkpoint data more frequently (Sato et al, 2014). This generates an enormous amount of write traffic to storage systems (Rajachandrasekar et al, 2013). Much work (Ali et al, 2009;Dongarra, 2010;Shalf et al, 2010;Bent et al, 2012;Lofstead et al, 2016) has shown that current HDD-based storage systems have been stretched to their limits in handling the tremendous amount of I/O.…”
Section: Introductionmentioning
confidence: 99%
“…My primary task is to improve the performance of UnifyFS in checkpoint restarts. For this purpose, I will delve into checkpointing [16] technology and fine-tune its generation, storage, and recovery processes [17] , aiming to enhance the reliability and efficiency of the entire system. Simultaneously, I plan to integrate the Veloc tool into UnifyFS, which will enhance the system's capabilities in data management and recovery, and offer more flexible and efficient solutions through data snapshots and version control.…”
Section: Discussionmentioning
confidence: 99%