2010
DOI: 10.1007/978-3-642-13119-6_23
|View full text |Cite
|
Sign up to set email alerts
|

Checkpointing and Migration of Communication Channels in Heterogeneous Grid Environments

Abstract: Abstract. A grid checkpointing service providing migration and transparent fault tolerance is important for distributed and parallel applications executed in heterogeneous grids. In this paper we address the challenges of checkpointing and migrating communication channels of grid applications executed on nodes equipped with different checkpointer packages. We present a solution that is transparent for the applications and the underlying checkpointers. It also allows using single node checkpointers for distribu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…In [10] we have evaluated the coordinated checkpointing protocol using a similar client-server application within a heterogeneous environment consisting of single PC-nodes and an SSI cluster. Therefore, we have measured the time it takes to checkpoint this application when it opens up 50 communications channels and sends 100 Byte packets periodically every five seconds.…”
Section: F Performance Of Coordinated and Independent Checkpointingmentioning
confidence: 99%
See 2 more Smart Citations
“…In [10] we have evaluated the coordinated checkpointing protocol using a similar client-server application within a heterogeneous environment consisting of single PC-nodes and an SSI cluster. Therefore, we have measured the time it takes to checkpoint this application when it opens up 50 communications channels and sends 100 Byte packets periodically every five seconds.…”
Section: F Performance Of Coordinated and Independent Checkpointingmentioning
confidence: 99%
“…This service is designed to support different checkpointing protocols and address the underlying grid-node checkpointers in a transparent manner through a uniform interface. Despite the generic design of this service, it currently only implements the coordinated checkpointing protocol [10]. Given a grid environment where data and computing resources of one application can be distributed across thousands of grid nodes, coordinated checkpointing can be costly in terms of scalability [10].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation