2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks 2014
DOI: 10.1109/dsn.2014.18
|View full text |Cite
|
Sign up to set email alerts
|

Failure Analysis of Virtual and Physical Machines: Patterns, Causes and Characteristics

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 83 publications
(26 citation statements)
references
References 10 publications
0
26
0
Order By: Relevance
“…The authors of [16] conduct a large scale analysis comparing and relating physical and virtual machine failures from commercial data centers. However, their study is limited due to an inconsistent clarity across different data sources they use.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors of [16] conduct a large scale analysis comparing and relating physical and virtual machine failures from commercial data centers. However, their study is limited due to an inconsistent clarity across different data sources they use.…”
Section: Related Workmentioning
confidence: 99%
“…Birke et al [16] suggest that virtual and physical machine TBFs have very similar distributions and the best fit is the gamma distribution with decreasing hazard rate, whereas the times to repair are well modelled with lognormal distribution. Instead, Viswanath et al [14] shows that time between successive failures on the same machine fits well an inverse function model.…”
Section: Related Workmentioning
confidence: 99%
“…Prior work on system reliability [11] points out that there exist positive correlations between application performance and system load. To demystify the time-varying behavior of failed jobs across priorities, we also resort to the system load.…”
Section: A Dependency On System Loadmentioning
confidence: 99%
“…Consequently, for each job, we compute the load indicators for three categories of priority: (i) its priority (ii) lower priority and (iii) higher priority. For example, when a job with priority 7 arrives at the system, we compute three different values of task arrival, considering tasks of priority 7, tasks with priority among [0, 6], and tasks with priority among [8,11], respectively. A similar computation needs to be applied for the throughput, as well as for the number of tasks.…”
Section: B Featuresmentioning
confidence: 99%
“…Although a large body of related work analyzes failures in large datacenters, most notably in terms of hardware [4], [11], software [7], [8], network components [5], and virtual machines [12], little work has been done [13], [14] in studying the broader class of unsuccessful executions in big-data systems. Nevertheless, deepening our knowledge in this field is of paramount importance, as unsuccessful executions can result in degradation of Quality of Service (QoS), reliability and energy waste that can ultimately lead to a high resource waste and performance impairment.…”
Section: Introductionmentioning
confidence: 99%