Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing
DOI: 10.1109/ftcs.1997.614079
|View full text |Cite
|
Sign up to set email alerts
|

A communication-induced checkpointing protocol that ensures rollback-dependency trackability

Abstract: Considering an application in which processes take local checkpoints independently (called basic checkpoints), this paper develops a protocol that forces them to take some additional local checkpoints (called forced checkpoints) in order that the resulting checkpoint and communication pattern satisfies the Rollback Dependency Trackability (RDT) property. This property states that all dependencies between local checkpoints are on-line trackable by using a transitive dependency vectol:Compared to other protocols… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
78
1
4

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 75 publications
(83 citation statements)
references
References 12 publications
0
78
1
4
Order By: Relevance
“…Communication-induced checkpointing avoids the domino-effect without requiring all checkpoints to be coordinated [12], [33], [55]. In these protocols, processes take two kinds of checkpoints, local and forced.…”
Section: Quasi-synchronous or Communication Induced Checkpointingmentioning
confidence: 99%
“…Communication-induced checkpointing avoids the domino-effect without requiring all checkpoints to be coordinated [12], [33], [55]. In these protocols, processes take two kinds of checkpoints, local and forced.…”
Section: Quasi-synchronous or Communication Induced Checkpointingmentioning
confidence: 99%
“…Besides these two fundamental approaches there is another approach known as communication induced check pointing approach (J. Tsai et al, 1998;R. Baldoni et al, 1997;J.…”
Section: Introductionmentioning
confidence: 99%
“…In the case of a fault, processes rollback to the last checkpointed state. Communication-induced Checkpointing: It avoids the domino-effect without requiring all checkpoints to be coordinated [2], [7], [9]. In these protocols, processes take two kinds of checkpoints, local and forced.…”
Section: Introduction 11 Definitions and Notationsmentioning
confidence: 99%
“…To recover from a failure, the system restarts its execution from a previous consistent global state saved on the stable storage during fault-free execution. In distributed systems, checkpointing can be independent, coordinated [3], [8], [11] or quasi-synchronous [2], [9]. Message Logging is also used for fault tolerance in distributed systems [14].…”
Section: Introduction 11 Definitions and Notationsmentioning
confidence: 99%