2018
DOI: 10.2139/ssrn.3189845
|View full text |Cite
|
Sign up to set email alerts
|

Robust Experimentation in the Continuous Time Bandit Problem

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 31 publications
0
5
0
Order By: Relevance
“…General nonconvexconcave problems can be solved by a simple gradient descent-ascent (GDA) scheme combined with quadratic regularization in y, with iteration complexity O(ε −5 ) [53]. 6 More elaborate algorithmic schemes based on the proximal-point method have O(ε −3 ) iteration complexity [54,95,55,57]. Here we propose a GDA-type scheme in the form of Algorithm 2.…”
Section: Algorithms Based On the Constant And Linear Approximationsmentioning
confidence: 99%
See 2 more Smart Citations
“…General nonconvexconcave problems can be solved by a simple gradient descent-ascent (GDA) scheme combined with quadratic regularization in y, with iteration complexity O(ε −5 ) [53]. 6 More elaborate algorithmic schemes based on the proximal-point method have O(ε −3 ) iteration complexity [54,95,55,57]. Here we propose a GDA-type scheme in the form of Algorithm 2.…”
Section: Algorithms Based On the Constant And Linear Approximationsmentioning
confidence: 99%
“…This allows to regularize with ρ = ρ 1 and results in ε-independent smoothness O( λ1 + µ 2 ρ 1 ) of the corresponding primal function. 6 This estimate follows from the complexity…”
Section: Algorithms Based On the Constant And Linear Approximationsmentioning
confidence: 99%
See 1 more Smart Citation
“…Now suppose the contract (s GP (I), s GP (R)) is offered to the GP. Similar to the binary case, the contract should satisfy s GP (I) ≥ p i s GP (R) (17) to make sure that GP does not invest in the bad project. Once offered, the agent chooses effort λ i which is the solution to the problem max…”
Section: And Cmentioning
confidence: 99%
“…( 1) minimizes the worse data distribution. Prior studies [2,3,4,15,42] considered different uncertainty sets (see Definition 3.1 in [15]) for which they proposed equivalent reformulations of Eq. (1) based on the specific choice of U m .…”
Section: Introductionmentioning
confidence: 99%