Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles 2009
DOI: 10.1145/1629575.1629587
|View full text |Cite
|
Sign up to set email alerts
|

Detecting large-scale system problems by mining console logs

Abstract: Surprisingly, console logs rarely help operators detect problems in large-scale datacenter services, for they often consist of the voluminous intermixing of messages from many software components written by independent developers. We propose a general methodology to mine this rich source of information to automatically detect system runtime problems. We first parse console logs by combining source code analysis with information retrieval to create composite features. We then analyze these features using machin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
618
0
4

Year Published

2014
2014
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 938 publications
(624 citation statements)
references
References 23 publications
2
618
0
4
Order By: Relevance
“…Barham et al [5] proposed to use clustering to identify anomalous requests. Xu et al [38] proposed to find erroneous execution paths using the PCA [19] on console logs. Bod et al [6] used logistic regression with L1 regularization on the vector of metric quantiles to fingerprint performance crisis.…”
Section: Related Workmentioning
confidence: 99%
“…Barham et al [5] proposed to use clustering to identify anomalous requests. Xu et al [38] proposed to find erroneous execution paths using the PCA [19] on console logs. Bod et al [6] used logistic regression with L1 regularization on the vector of metric quantiles to fingerprint performance crisis.…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, modern systems consisting of multiple components developed by different vendors and the frequent upgrades of those components make it difficult for a single expert to have complete knowledge of the total system and to set the rules effectively. This complexity has given rise to statistical learning based log analytic tools such as the works of Lou et al (Lou et al, 2010) and Xu et al (Xu et al, 2009), which extract features from console logs and then use statistical techniques to automatically build models for system anomaly identification. Lou et al (Lou et al, 2010) propose a statistical learning technique which consists of a learning process and a detection process.…”
Section: Console Log Based Anomaly Detectionmentioning
confidence: 99%
“…Each mismatch in the invariants is considered to be anomalous. Xu et al (Xu et al, 2009) propose a new methodology to mine console logs to automatically detect system problems. This first creates feature vectors from the logs and then applies the PCA (Principal Component Analysis) algorithm on the feature vectors to detect anomalies.…”
Section: Console Log Based Anomaly Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Different analysis tasks pay attention to different application aspects, such as system failure tracing [19,10], event correlation discovery [20,18,16], and event based trend analysis [5,6,7]. In practice, these methods are often conducted when the analysts already have some prior knowledge about the data.…”
Section: Related Workmentioning
confidence: 99%