2009
DOI: 10.1016/j.peva.2008.11.003
|View full text |Cite
|
Sign up to set email alerts
|

Numerical computation algorithms for sequential checkpoint placement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2010
2010
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 19 publications
(12 citation statements)
references
References 34 publications
0
12
0
Order By: Relevance
“…Different from the previous [8] and [10][11][12], the proposed mathematical analytical model is independent of the explicit expression of F (t). In other words, the availability of the proposed checkpoint scheduling algorithm cannot be affected by the variety of the failure rate r(t) = F (t)…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Different from the previous [8] and [10][11][12], the proposed mathematical analytical model is independent of the explicit expression of F (t). In other words, the availability of the proposed checkpoint scheduling algorithm cannot be affected by the variety of the failure rate r(t) = F (t)…”
Section: Discussionmentioning
confidence: 99%
“…According to [9], it can be concluded that a constant CI is optimal on condition that the system fault follows Poisson/exponential process. For the particular PF 2 failure distribution, the non-increasing CI sequence can be performed in [10][11][12]. For large-scale HPC system, Liu presented the reliability-aware method for optimal checkpoint/restart strategy to minimize rollback recovery and checkpointing overheads [13,14].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Conversely, if CPs are seldom placed, a larger RB overhead after a system failure will be required. Hence, it is important to determine the optimal CP interval taking account of the trade-off between two kinds of overhead factors above [4], [5], [16]. Gelenbe at al.…”
Section: Introductionmentioning
confidence: 99%
“…Bouguerra et al [2] also give an analytical model with coordinated CP/RB for a large scale cluster system. It is worth mentioning that the above works are based on the direct application of the similar analytical techniques to the CP placement for coherent computer systems [4], [5], [16]. However, the above works did not consider the possibility of occurrence of multi-node failure.…”
Section: Introductionmentioning
confidence: 99%