2022
DOI: 10.1016/j.arcontrol.2022.09.003
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic design of optimal sequential decision-making algorithms in learning and control

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 97 publications
0
3
0
Order By: Relevance
“…A key feature of our service is that it is built around a framework that intrinsically allows to consider stochastic behaviors, thus explicitly accounting for users’ stochasticity and to e.g., capture their privacy requirements. Specifically, CRAWLING exploits a recent data-driven control algorithm 6 , 7 that returns a randomized behavior for the car (i.e., a probability function) by solving a sequential decision-making problem 8 that formalizes the tracking of a desired behavior expressed in probabilistic terms. That is, our service systematically minimizes a cost function consisting of: (i) a reward capturing road/traffic conditions; (ii) a regularizer that biases the solution towards some target behavior, allowing CRAWLING to keep into account the possible preferences of passengers.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A key feature of our service is that it is built around a framework that intrinsically allows to consider stochastic behaviors, thus explicitly accounting for users’ stochasticity and to e.g., capture their privacy requirements. Specifically, CRAWLING exploits a recent data-driven control algorithm 6 , 7 that returns a randomized behavior for the car (i.e., a probability function) by solving a sequential decision-making problem 8 that formalizes the tracking of a desired behavior expressed in probabilistic terms. That is, our service systematically minimizes a cost function consisting of: (i) a reward capturing road/traffic conditions; (ii) a regularizer that biases the solution towards some target behavior, allowing CRAWLING to keep into account the possible preferences of passengers.…”
Section: Introductionmentioning
confidence: 99%
“…In this context, decision engines 50 have been recently designed that enable agents to make decisions by merging streams of available data. The engines, which leverage the probabilistic framework 8 also used in this paper, have been also applied to parking management. Finally, data-driven intelligent transportation systems, and their data-driven analysis 51 have also gained traction, although heterogeneous datasets, emerging from different information streams that need to be merged, have been singled out as a challenge 52 , 53 .…”
Section: Introductionmentioning
confidence: 99%
“…divergence minimization and we refer to e.g., [15], [16] for examples across learning and control that involve minimizing this functional. Further, the study of mechanisms enabling agents to re-use data, also arises in the design of prediction algorithms from experts [17] and of learning algorithms from multiple simulators [18].…”
Section: Introductionmentioning
confidence: 99%