2008
DOI: 10.1109/icpp-w.2008.30
|View full text |Cite
|
Sign up to set email alerts
|

Disparity: Scalable Anomaly Detection for Clusters

Abstract: In this paper, we describe disparity, a tool that does parallel, scalable anomaly detection for clusters. Disparity uses basic statistical methods and scalable reduction operations to perform data reduction on client nodes and uses these results to locate node anomalies. We discuss the implementation of disparity and present results of its use on a SiCortex SC5832 system.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 6 publications
0
2
0
Order By: Relevance
“…The second part of PHM is a core infrastructure that launches the diagnostics, aggregates the results, generates/updates snapshots, and will ultimately provide control to determine what diagnostic to launch next. Part of this infrastructure (the diagnostic launching, snapshot manipulation, and control logic) will run on a head node, and part of this infrastructure (data aggregation) is distributed across the system and uses the system itself to efficiently compute statistics, a la Disparity [7]. An important goal is to make it as easy as possible for users of PHM to develop their own diagnostics without needing to reinvent such an infrastructure.…”
Section: Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…The second part of PHM is a core infrastructure that launches the diagnostics, aggregates the results, generates/updates snapshots, and will ultimately provide control to determine what diagnostic to launch next. Part of this infrastructure (the diagnostic launching, snapshot manipulation, and control logic) will run on a head node, and part of this infrastructure (data aggregation) is distributed across the system and uses the system itself to efficiently compute statistics, a la Disparity [7]. An important goal is to make it as easy as possible for users of PHM to develop their own diagnostics without needing to reinvent such an infrastructure.…”
Section: Approachmentioning
confidence: 99%
“…However, PHM is the first to utilize a global view of system performance rather than an aggregation of local views. To elaborate, existing tools for monitoring system performance, including Disparity [7], NWPerf [15], Supermon [24], ClusterProbe [12], and Performance Co-Pilot [22], are all designed for continuous monitoring of system performance. Consequently, measurement overhead is a serious concern.…”
Section: Node-local Performance Monitoringmentioning
confidence: 99%