2008
DOI: 10.1155/2008/982349
|View full text |Cite
|
Sign up to set email alerts
|

A Low‐Cost Hybrid Coordinated Checkpointing Protocol for Mobile Distributed Systems

Abstract: Mobile distributed systems raise new issues such as mobility, low bandwidth of wireless channels, disconnections, limited battery power and lack of reliable stable storage on mobile nodes. In minimum-process coordinated checkpointing, some processes may not checkpoint for several checkpoint initiations. In the case of a recovery after a fault, such processes may rollback to far earlier checkpointed state and thus may cause greater loss of computation. In all-process coordinated checkpointing, the recovery line… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
39
0

Year Published

2010
2010
2015
2015

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(39 citation statements)
references
References 33 publications
0
39
0
Order By: Relevance
“…Checkpointing overheads [55], [56], [57], [58] have been discussed by many researchers. An integrated checkpointing algorithm implements in parallel with the essential computation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Checkpointing overheads [55], [56], [57], [58] have been discussed by many researchers. An integrated checkpointing algorithm implements in parallel with the essential computation.…”
Section: Related Workmentioning
confidence: 99%
“…The model described in [56] uses useful work, i.e., computation that contributes to job completion, to measure system performance.…”
Section: Related Workmentioning
confidence: 99%
“…Deterministic Systems: If two processes start in the same state, and both receive the identical sequence of inputs, they will produce the identical sequence outputs and will finish in the same state. The state of a process is thus completely determined by its starting state and by sequence of messages it has received [23, [24], [25]. Checkpoint Interval (CI): The i th CI of a process denotes all the computation performed between its i th and (i+1) th checkpoint, including the i th checkpoint but not the (i+1) th checkpoint.…”
Section: Introduction 11 Definitions and Notationsmentioning
confidence: 99%
“…In order to record a consistent global checkpoint, when a process takes a checkpoint, it asks (by sending checkpoint requests to) all relevant processes to take checkpoints. Therefore, coordinated checkpointing suffers from high overhead associated with the checkpointing process [20], [21], [22], [23]. Much of the previous work [2,3,4,20,21,22,23] in coordinated checkpointing has focused on minimizing the number of synchronization messages and the number of checkpoints during the checkpointing process.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, coordinated checkpointing suffers from high overhead associated with the checkpointing process [20], [21], [22], [23]. Much of the previous work [2,3,4,20,21,22,23] in coordinated checkpointing has focused on minimizing the number of synchronization messages and the number of checkpoints during the checkpointing process. However, some algorithms (called blocking algorithm) force all relevant processes in the system to block their computations during the checkpointing process [3,9,21,22,23].…”
Section: Introductionmentioning
confidence: 99%