2018
DOI: 10.1016/j.compind.2018.03.027
|View full text |Cite
|
Sign up to set email alerts
|

Fault tolerance in cloud computing environment: A systematic survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
38
0
2

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 82 publications
(40 citation statements)
references
References 53 publications
0
38
0
2
Order By: Relevance
“…The problem of fault tolerance is more challenging in cloudbased systems since the cloud computing architecture is highly complex and dynamically growing [30], [31]. According to a recent survey [32], among the traditional fault tolerance mechanisms (e.g., [6], [29], [33]), Checkpoint and Restart (CR) [6] is commonly used for implementing fault tolerance in the Cloud. The CR mechanism is normally utilized in order to restart applications in case of failures (e.g., [34], [35]), while it is also used for migrating tasks and applications from one node of the cloud to another (e.g., [36]) when special circumstances impose it (e.g., node unavailability).…”
Section: Prior Workmentioning
confidence: 99%
“…The problem of fault tolerance is more challenging in cloudbased systems since the cloud computing architecture is highly complex and dynamically growing [30], [31]. According to a recent survey [32], among the traditional fault tolerance mechanisms (e.g., [6], [29], [33]), Checkpoint and Restart (CR) [6] is commonly used for implementing fault tolerance in the Cloud. The CR mechanism is normally utilized in order to restart applications in case of failures (e.g., [34], [35]), while it is also used for migrating tasks and applications from one node of the cloud to another (e.g., [36]) when special circumstances impose it (e.g., node unavailability).…”
Section: Prior Workmentioning
confidence: 99%
“…In cloud, a lot of research work is done in the field of energy efficiency [3][4] [5], fault tolerance [6], reputation [17] load balancing [7], and decision making systems for service selection [8] [9] etc. but there is lack of research towards CGR selection problem.…”
Section: A Motivation and Contributionmentioning
confidence: 99%
“…Step 5: Determine the leaving outranking flow and entering the outranking flows . The leaving outranking flow is defined as: (5) The entering outranking flow is defined as: (6) Step 6: Calculate the outranking flow for each CSP (7)…”
Section: (4)mentioning
confidence: 99%
“…The tendency of software to fail or cause a system failure after running continuously for a specific time period is referred to as software aging [6,7]. Software aging is a phenomenon in long-run software systems that causes an increased failure rate and/or degraded performance due to accumulation of aging errors [8,9].…”
Section: Introductionmentioning
confidence: 99%
“…Since VM requests in IaaS clouds are usually different in terms of software and tools for which they are initiated, their corresponding VMs exhibit complex behaviors and sophisticated interactions throughout their lifetime that enable VMMs to manage a wide variety of VM behaviors. Consequently, after running for a long time or managing heavy workloads, VMMs, like any other software, age and slow due to a multitude of internal errors and diverse behaviors of VMs [7,[10][11][12][13].…”
Section: Introductionmentioning
confidence: 99%