2020 IEEE Security and Privacy Workshops (SPW) 2020
DOI: 10.1109/spw50608.2020.00067
|View full text |Cite
|
Sign up to set email alerts
|

Research Report: ICARUS: Understanding De Facto Formats by Way of Feathers and Wax

Abstract: When a data format achieves a significant level of adoption, the presence of multiple format implementations expands the original specification in often-unforeseen ways. This results in an implicitly defined, de facto format, which can create vulnerabilities in programs handling the associated data files. In this paper we present our initial work on ICARUS: a toolchain for dealing with the problem of understanding and hardening de facto file formats. We show the results of our work in progress in the following… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…Given a grammar, we can generate a parser for that grammar, and then apply that parser to a sentence; if the parser fails, then the sentence can be labeled as anomalous. However, it can sometimes be more natural to learn a parser, rather than a grammar, as was done in (Cowger et al, 2020;Woods, 2021). As the resulting parser successfully produces a parse tree for any sentence, it cannot be used directly for anomaly detection.…”
Section: Discussion and Future Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Given a grammar, we can generate a parser for that grammar, and then apply that parser to a sentence; if the parser fails, then the sentence can be labeled as anomalous. However, it can sometimes be more natural to learn a parser, rather than a grammar, as was done in (Cowger et al, 2020;Woods, 2021). As the resulting parser successfully produces a parse tree for any sentence, it cannot be used directly for anomaly detection.…”
Section: Discussion and Future Workmentioning
confidence: 99%
“…For the specific example in Fig. 2 (left), the production rules happen to cover all sentences described by the grammar S -> '{' ('a' | 'b' | 'c' | S+) '}', referred to as the Simple-JSON grammar (Cowger et al, 2020;Woods, 2021); in particular, the '{G}' atom (appearing on the left side of the production rule '{G}' -> '{G' '}') approximately corresponds to S. However, while the rules will successfully parse any valid Simple-JSON sentence, will they also fail at parsing any sentence that is not in the Simple-JSON format, i.e., that is anomalous from the perspective of Simple-JSON? We will address this question in the following two subsections.…”
Section: Production Rule Extractionmentioning
confidence: 99%
See 2 more Smart Citations
“…1 were able to be learned automatically, solely from data. Applying RL to the grammar inference problem was initially presented at LangSec 2020 [6],…”
Section: Introductionmentioning
confidence: 99%