Proceedings of the 26th Conference on Program Comprehension 2018
DOI: 10.1145/3196321.3196340
|View full text |Cite
|
Sign up to set email alerts
|

A search-based approach for accurate identification of log message formats

Abstract: Many software engineering activities process the events contained in log files. However, before performing any processing activity, it is necessary to parse the entries in a log file, to retrieve the actual events recorded in the log. Each event is denoted by a log message, which is composed of a fixed part-called (event) template-that is the same for all occurrences of the same event type, and a variable part, which may vary with each event occurrence. The formats of log messages, in complex and evolving syst… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
53
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 120 publications
(53 citation statements)
references
References 40 publications
0
53
0
Order By: Relevance
“…For example, if a log sequence [E1, E2, E2] is parsed to [E1, E4, E5], we get PA=1/3, since the 2nd and 3rd messages are not grouped together. In contrast to standard evaluation metrics that are used in previous studies, such as precision, recall, and F1-measure [9], [22], [28], PA is a more rigorous metric. In PA, partially matched events are considered incorrect.…”
Section: A Experimental Setupmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, if a log sequence [E1, E2, E2] is parsed to [E1, E4, E5], we get PA=1/3, since the 2nd and 3rd messages are not grouped together. In contrast to standard evaluation metrics that are used in previous studies, such as precision, recall, and F1-measure [9], [22], [28], PA is a more rigorous metric. In PA, partially matched events are considered incorrect.…”
Section: A Experimental Setupmentioning
confidence: 99%
“…For example, Spell [24] utilizes the longest common subsequence algorithm to parse logs in a stream manner. Recently, Messaoudi et al [28] propose MoLFI, which models log parsing as a multipleobjective optimization problem and solves it using evolutionary algorithms.…”
Section: Techniques Of Log Parsersmentioning
confidence: 99%
“…Although beneficial to the effectiveness of the log parsing, this is a manual process. Messaoudi et al [18] capture the template of a message by applying an evolutionary algorithm. This first of a kind approach significantly outperforming other approaches ( [14], [17]).…”
Section: Related Workmentioning
confidence: 99%
“…As there is no direct connection between log messages and source code in the produced (raw) log data, the link must be created afterwards. Previous works propose several approaches based on clustering [10], [23], heuristics [17], [24], longest common sequence method [9], textual similarities [14], evolutionary search [18], and static analysis [28] to solve this challenge.…”
Section: Introductionmentioning
confidence: 99%
“…In this work we assume that we have already prepared such set of log templates for a given application using existing techniques. Accurately discovering the log templates from the given set of log data is an active field of research and there are several techniques available for us to use [20][21][22][23]. It is not the goal of this work to design new log template discovery techniques.…”
Section: Log Templatesmentioning
confidence: 99%