2019
DOI: 10.14778/3352063.3352105
|View full text |Cite
|
Sign up to set email alerts
|

Grano

Abstract: We demonstrate Grano 1 , an end-to-end anomaly detection and root cause analysis (or RCA for short) system for cloud-native distributed data platform by providing a holistic view of the system component topology, alarms and application events. Grano provides: a Detection Layer to process large amount of time-series monitoring data to detect anomalies at logical and physical system components; an Anomaly Graph Layer with novel graph modelin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(3 citation statements)
references
References 3 publications
0
3
0
Order By: Relevance
“…With the recent upsurge of work in the field of performance diagnosis of large and complex modern microservice architectures [10,11,17,27,29], researchers at many cloud-based companies are actively working on alerting and monitoring solutions [12,21,44]. Alerting services like Watchdog, New Relic and Splunk diagnose systems by constructing causal graphs between services or using thresholding techniques.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…With the recent upsurge of work in the field of performance diagnosis of large and complex modern microservice architectures [10,11,17,27,29], researchers at many cloud-based companies are actively working on alerting and monitoring solutions [12,21,44]. Alerting services like Watchdog, New Relic and Splunk diagnose systems by constructing causal graphs between services or using thresholding techniques.…”
Section: Related Workmentioning
confidence: 99%
“…Service dependency graphs are also used to answer "what-if" based system questions on bandwidth management and application latencies [16,18,41,42]. Grano [44] builds a causal dependency graph among physical resources for fault diagnosis. However, performance diagnosis is often demanded at a more granular level.…”
Section: Related Workmentioning
confidence: 99%
“…With the growth of the DCN scale and the emerging application of new network devices, failures in DCNs have become the norm rather than occasional events. Failures in DCNs usually have more severe consequences than in general networks, from disruption of services to loss of critical data [5]. Meanwhile, the complexity of large-scale DCN management makes it difficult for network administrators to find and locate failures in a timely manner.…”
Section: Introductionmentioning
confidence: 99%