2020
DOI: 10.1016/j.jss.2019.110464
|View full text |Cite
|
Sign up to set email alerts
|

Predicting failures in multi-tier distributed systems

Abstract: Many applications are implemented as multi-tier software systems, and are executed on distributed infrastructures, like cloud infrastructures, to benefit from the cost reduction that derives from dynamically allocating resources ondemand. In these systems, failures are becoming the norm rather than the exception, and predicting their occurrence, as well as locating the responsible faults, are essential enablers of preventive and corrective actions that can mitigate the impact of failures, and significantly imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 35 publications
(32 citation statements)
references
References 41 publications
0
32
0
Order By: Relevance
“…Compared to studies where failures are used in the training phase [10], [13], HTM achieved slightly lower effectiveness. Although results obtained in different experiments are not directly comparable, a gap was expected.…”
Section: Rq1: Prediction Effectivenessmentioning
confidence: 70%
See 3 more Smart Citations
“…Compared to studies where failures are used in the training phase [10], [13], HTM achieved slightly lower effectiveness. Although results obtained in different experiments are not directly comparable, a gap was expected.…”
Section: Rq1: Prediction Effectivenessmentioning
confidence: 70%
“…Note that failure-free executions are not free of anomalies, since services may operate in an unpredictable way, despite not generating any failure. To generate the traffic for the running system, we generated a number of users and calls according to weekly and daily patterns (e.g., more users on workdays, fewer users at night, pick time at 9am and 7pm), including a certain degree of randomness, as done in similar studies that used Clearwater as subject system [10].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…This is the improved version of their offline technique already introduced in Section 5.3.1. Lastly, Mariani et al [80] aim at predicting failures in distributed multi-tier environments by employing a two-stage approach, consisting of anomaly detection in the first stage, and a signature-based classification technique in the second stage.…”
Section: Architectural Modelsmentioning
confidence: 99%