2022
DOI: 10.48550/arxiv.2211.13904
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Policy-Adaptive Estimator Selection for Off-Policy Evaluation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…We thus consider the MSE estimation task as an independent research topic and do not propose specific approaches to estimate the MSE from the logged data. Instead, our experiments will demonstrate that AIPS with our data-driven procedure for behavior model optimization performs reasonably well across a variety of experiment settings, even with a noisy MSE estimate and with an MSE estimated via an existing method from Udagawa et al [37] that uses only the observed logged data.…”
Section: Optimizing the User Behavior Modelmentioning
confidence: 88%
See 3 more Smart Citations
“…We thus consider the MSE estimation task as an independent research topic and do not propose specific approaches to estimate the MSE from the logged data. Instead, our experiments will demonstrate that AIPS with our data-driven procedure for behavior model optimization performs reasonably well across a variety of experiment settings, even with a noisy MSE estimate and with an MSE estimated via an existing method from Udagawa et al [37] that uses only the observed logged data.…”
Section: Optimizing the User Behavior Modelmentioning
confidence: 88%
“…Our work also raises several intriguing research questions for future studies. First, it would be valuable to develop an accurate way to estimate the MSE of an OPE estimator beyond existing methods [35,37] to better optimize the user behavior model to further improve AIPS. Second, OPE of ranking policies can still become extremely difficult when the number of unique actions (|A|) is large.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations