2012
DOI: 10.1145/2347736.2347751
|View full text |Cite
|
Sign up to set email alerts
|

Fault injection in production

Abstract: Making the case for resilience testing.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…They call this practice "chaos engineering". When fault injection is done in production on a special day under full control (as opposed to automatically at any arbitrary point in time), it is called a GameDay exercise [1].…”
Section: Failure Injection In Productionmentioning
confidence: 99%
“…They call this practice "chaos engineering". When fault injection is done in production on a special day under full control (as opposed to automatically at any arbitrary point in time), it is called a GameDay exercise [1].…”
Section: Failure Injection In Productionmentioning
confidence: 99%
“…There has been a paradigm shift --from trying to avoid failures at all costs to embracing faults as opportunities for making the system more resilient. The rationale behind fault injection testing of deployed software can be summarized as follows [4]:…”
Section: Relationship To Cloud-based Solutionsmentioning
confidence: 99%
“…Their focus mainly lies on hardware fault models, namely network partitions and latency, as well as node crashes. There are also "fault model agnostic", configurable solutions such as [7] [8] [4]. These solutions provide a framework for easily injecting faults, but the fault classes themselves have to be implemented to some extent by the user.…”
mentioning
confidence: 99%
“…Fault injection is a significant solution to emulate these problems in a controlled way, to make distributed systems more fault-tolerant. For example, several large companies, such as Netflix, Uber, Amazon, have been using fault injection for their chaos engineering and game day exercises to assess the reliability of their services [13,14]. Unfortunately, fault injection has a high entry barrier, and it is still beyond the reach of the minor service providers, due to the cost and complexity of planning and orchestrate fault injection experiments.…”
Section: Introductionmentioning
confidence: 99%