2019
DOI: 10.1007/978-3-030-30942-8_38
|View full text |Cite
|
Sign up to set email alerts
|

$$L^*$$-Based Learning of Markov Decision Processes

Abstract: Automata learning techniques automatically generate system models from test observations. These techniques usually fall into two categories: passive and active. Passive learning uses a predetermined data set, e.g., system logs. In contrast, active learning actively queries the system under learning, which is considered more efficient. An influential active learning technique is Angluin's L * algorithm for regular languages which inspired several generalisations from DFAs to other automata-based modelling forma… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 25 publications
(22 citation statements)
references
References 45 publications
0
22
0
Order By: Relevance
“…To the best of our knowledge, L * mdp is the first L*-based algorithm for MDPs that can be implemented via testing. Experimental results and the implementation can be found in the evaluation material for L * mdp [Tap20].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…To the best of our knowledge, L * mdp is the first L*-based algorithm for MDPs that can be implemented via testing. Experimental results and the implementation can be found in the evaluation material for L * mdp [Tap20].…”
Section: Discussionmentioning
confidence: 99%
“…All tables include mean values along with the corresponding standard deviations separated by ±. Experimental results, the examined models, and the implementation of L * mdp can be found in the evaluation material [Tap20].…”
Section: Methodsmentioning
confidence: 99%
“…One easily adapts the example from Section 5 to show that learning probabilistic automata has a similar termination issue. On the positive side, Tappler et al [26] have shown that deterministic MDPs can be learned using an L based algorithm. The deterministic MDPs in loc.cit.…”
Section: Discussionmentioning
confidence: 99%
“…Predictive monitoring has been combined with deep learning [17] and Bayesian inference [22], where the key problem is that the computation of an imminent failure is too expensive to be done exactly. More generally, learning automata models has been motivated with runtime assurance [1,55]. Testing approaches statistically evaluate whether traces are likely to be produced by a given model [25].…”
Section: Related Workmentioning
confidence: 99%