2011
DOI: 10.1002/cpe.1879
|View full text |Cite
|
Sign up to set email alerts
|

A checkpointing‐enabled and resource‐aware Java Virtual Machine for efficient and robust e‐Science applications in grid environments

Abstract: Object-oriented programming languages presently are the dominant paradigm of application development (e. g., Java,. NET). Lately, increasingly more Java applications have long (or very long) execution times and manipulate large amounts of data/information, gaining relevance in fields related with e-Science (with Grid and Cloud computing). Significant examples include Chemistry, Computational Biology and Bio-informatics, with many available Java-based APIs (e. g., Neobio).Often, when the execution of such an ap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…Among them, the implementation of a fault tolerant Java Virtual Machine that supports checkpointing is proposed in [14], and the robustness of the critical paths of workflows is ensured in [15]. In both cases, an issue of the work is to find a balance between resource consumption and desired reliability.…”
Section: Experimental Middlewarementioning
confidence: 99%
“…Among them, the implementation of a fault tolerant Java Virtual Machine that supports checkpointing is proposed in [14], and the robustness of the critical paths of workflows is ensured in [15]. In both cases, an issue of the work is to find a balance between resource consumption and desired reliability.…”
Section: Experimental Middlewarementioning
confidence: 99%
“…In order to comply with this requisite, each instance of ARA-JVM is enhanced with services that allow for: i) monitor the application progress, ii) account resource consumption, iii) reconfigure internal parameters and/or mechanisms, and iv) checkpoint, restore and migrate the whole application. In [12] we focus on the last point which regards checkpointing and restore. Our current design and implementation effort mainly concerns the progress monitoring module and the resource scheduler.…”
Section: An Enabling Architecturementioning
confidence: 99%
“…Furthermore, the replay can only be necessary to be done from a certain point in time because the fault is known to occur only at the end of execution. Ditto uses a lightweight checkpointing mechanism [17] to offer two new replay services: (i) replay to most recent point before fault; (ii) replay to any instant M in execution. Checkpoint is done recording each thread stack and reachable objects.…”
Section: Lightweight Checkpointingmentioning
confidence: 99%
“…In this case, the total recording space is N * sizeof (checkpoint) + N * sizeof (truncatedLog), where N is the number of times a checkpoint is done. In this case there is a trade-off between overhead in execution time and granularity in available replay start times [17]. Even so, the total recording space is bounded to be smaller than 2 * N * sizeof (checkpoint).…”
Section: Lightweight Checkpointingmentioning
confidence: 99%