2021
DOI: 10.48550/arxiv.2103.15938
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Safe Model-based Control from Signal Temporal Logic Specifications Using Recurrent Neural Networks

Abstract: We propose a policy search approach to learn controllers from specifications given as Signal Temporal Logic (STL) formulae. The system model is unknown, and it is learned together with the control policy. The model is implemented as a feedforward neural network (FNN). To capture the history dependency of the STL specification, we use a recurrent neural network (RNN) to implement the control policy. In contrast to prevalent model-free methods, the learning approach proposed here takes advantage of the learned m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(16 citation statements)
references
References 27 publications
0
16
0
Order By: Relevance
“…In Problem 1, we look for a set of the parameters of the control policy W that maximizes the sum of the expectation of the robustness score with respect to the distribution of initial states over all M candidate STL specifications. Since the control policy depends on the past and present states, or it is history dependent, in this paper we use a recurrent neural network (RNN) to learn the control policy π (see, [14], [15]). RNN is a type of a neural network that has a feedback architecture.…”
Section: Problem Statementmentioning
confidence: 99%
See 3 more Smart Citations
“…In Problem 1, we look for a set of the parameters of the control policy W that maximizes the sum of the expectation of the robustness score with respect to the distribution of initial states over all M candidate STL specifications. Since the control policy depends on the past and present states, or it is history dependent, in this paper we use a recurrent neural network (RNN) to learn the control policy π (see, [14], [15]). RNN is a type of a neural network that has a feedback architecture.…”
Section: Problem Statementmentioning
confidence: 99%
“…Remark 3: When we work with a long temporal sequence of states, numerical instability can be problematic with the above scheme since when computing the gradient of the cost function, the derivative computed at the previous time step should be multiplied with a Jacobian matrix of the system model. To address this problem, [11], [15] employed the weights update scheme with the Lagrange Multiplier method. Although we do not consider these techniques in this paper since such a problem was not observed in the example discussed in Section VI, it may be desirable to use such techniques for applications with a long time horizon.…”
Section: B Training Rnnmentioning
confidence: 99%
See 2 more Smart Citations
“…We analyzed over 130 STL specifications and their associated English-language formulation, from scientific papers and industrial documents. The investigated literature covers multiple application domains: specification patterns [42], automatic driving [43][44][45], robotics [46][47][48][49][50], time-series analysis [51] and electronics [2,52]. Although this literature contains data that is not statistically exhaustive, it still provides valuable information to guide the design of the data generator and address the research question RQ1.…”
Section: Empirical Stl Statisticsmentioning
confidence: 99%