2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS) 2021
DOI: 10.1109/iwqos52092.2021.9521340
|View full text |Cite
|
Sign up to set email alerts
|

Practical Root Cause Localization for Microservice Systems via Trace Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(7 citation statements)
references
References 32 publications
0
7
0
Order By: Relevance
“…In [5] , An anomaly detection method ADS for container-based microservice performance metrics is proposed, which considers a set of key performance indicators for each containerized service and then uses a machine learning algorithm to train a classifier for detecting the indicator conditions. [6] Use a range of system status data such as per-call latency, HTTP status, etc. as metrics for tracing anomaly detection, and adaptively select useful metrics for each anomaly to improve monitoring accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…In [5] , An anomaly detection method ADS for container-based microservice performance metrics is proposed, which considers a set of key performance indicators for each containerized service and then uses a machine learning algorithm to train a classifier for detecting the indicator conditions. [6] Use a range of system status data such as per-call latency, HTTP status, etc. as metrics for tracing anomaly detection, and adaptively select useful metrics for each anomaly to improve monitoring accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…N, n the number of traces, the number of services m, c the type number of metrics, the collected number of each type metric Metric Anomaly Score: We use the mean ” ik and standard deviation σ ik of the service metrics to calculate service anomaly severity [20][21][22][23][24] . The ” ik is the expected normal value and the σ ik indicates that the metric deviates from the mean.…”
Section: Notation Definitionsmentioning
confidence: 99%
“…Moreover, the current service is influenced by other services if it contains incoming and outcoming abnormal invocations 15,20 . In order to calculate the additional teleportation vector U more accurately in the PageRank algorithm, we calculate the suspicious score by the absolute value of the difference between the numbers of traces that contain incoming abnormal invocations Of iin and outcoming abnormal invocations Of iout .…”
Section: Abnormal Services Rankingmentioning
confidence: 99%
See 1 more Smart Citation
“…Prior studies on root cause analysis [25][26][27]30] have mainly focused on a simplified scenario, where the target system is modeled as a single isolated causal graph, and the system's malfunctioning effects can only propagate within the same network of entities. For instance, to identify various types of service root causes, Liu et al [27] generated a service call graph based on domain-specific software and rules.…”
Section: Introductionmentioning
confidence: 99%