2021
DOI: 10.48550/arxiv.2106.03279
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Problems by Reinforcement Learning

Abstract: In the predict-then-optimize framework, the objective is to train a predictive model, mapping from environment features to parameters of an optimization problem, which maximizes decision quality when the optimization is subsequently solved. Recent work on decision-focused learning shows that embedding the optimization problem in the training pipeline can improve decision quality and help generalize better to unseen tasks compared to relying on an intermediate loss function for evaluating prediction quality. We… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…N. Elmachtoub & Grigas, 2021;Ferreira, Lee, & Simchi-Levi, 2016;Guo, Grushka-Cockayne, & De Reyck, 2021;J. Sun, Zhang, Hu, & Van Mieghem, 2021;K. Wang et al, 2021).…”
Section: Literature Reviewmentioning
confidence: 99%
“…N. Elmachtoub & Grigas, 2021;Ferreira, Lee, & Simchi-Levi, 2016;Guo, Grushka-Cockayne, & De Reyck, 2021;J. Sun, Zhang, Hu, & Van Mieghem, 2021;K. Wang et al, 2021).…”
Section: Literature Reviewmentioning
confidence: 99%