2016
DOI: 10.1007/978-3-319-45279-1_7
|View full text |Cite
|
Sign up to set email alerts
|

Certified Derivative-Based Parsing of Regular Expressions

Abstract: Parsing is pervasive in computing and fundamental in several software artifacts. This dissertation reports the first step in our ultimate goal: a formally verified toolset for parsing regular and context free languages based on derivatives. Specifically, we describe the formalization of Brzozowski and Antimirov derivative based algorithms for regular expression parsing, in the dependently typed language Agda. The formalization produces a proof that either an input string matches a given regular expression or t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 24 publications
0
5
0
Order By: Relevance
“…Hardin [11] uses the HOL4 theorem prover to verify an implementation of Brzozowski's regex-to-DFA algorithm, which is incorporated into a lexer. Lopes et al [16] present a regex matcher based on Brzozowski derivatives. The matcher takes a regular expression e and a string s as input; if s matches e, the tool produces a proof of the match.…”
Section: Related Workmentioning
confidence: 99%
“…Hardin [11] uses the HOL4 theorem prover to verify an implementation of Brzozowski's regex-to-DFA algorithm, which is incorporated into a lexer. Lopes et al [16] present a regex matcher based on Brzozowski derivatives. The matcher takes a regular expression e and a string s as input; if s matches e, the tool produces a proof of the match.…”
Section: Related Workmentioning
confidence: 99%
“…al. [28] describe an Idris formalization of a RE parsing tool using Brzozowski's derivatives. Like our work, they proved both soundness and completeness lemmas for the derivative operation and used data-types for denoting prefixes and substrings proofs for a given input RE and string.…”
Section: Related Workmentioning
confidence: 99%
“…al. [28] also mention that they use natural numbers in Peano notation to represent alphabet symbols as their respective ASCII codes, instead of using Idris type for characters. According to the authors, the reason for this design choice is due to the way that Idris deals with propositional equality for primitive types, like Char.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In subsequent work, Tan and Morrisett [24] certified encoder/decoder pairs are constructed from bidirectional grammars. Lopes, Ribeiro, and Camarão [19] have also verified a regular expression parser using Idris [5]. Bernardy and Jansson [3] have formalized Valiant's algorithm for parsing context-free languages in Agda [4].…”
Section: Introductionmentioning
confidence: 99%