2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) 2017
DOI: 10.1109/focs.2017.55
|View full text |Cite
|
Sign up to set email alerts
|

Oracle-Efficient Online Learning and Auction Design

Abstract: We consider the design of computationally efficient online learning algorithms in an adversarial setting in which the learner has access to an offline optimization oracle. We present an algorithm called Generalized Follow-the-Perturbed-Leader and provide conditions under which it is oracle-efficient while achieving vanishing regret. Our algorithm generalizes the Follow-the-Perturbed-Leader (FTPL) approach of Kalai and Vempala [32] in which the action with the highest randomly perturbed historical performance i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
43
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 27 publications
(43 citation statements)
references
References 37 publications
0
43
0
Order By: Relevance
“…But in the MMR problem, the revenue is not a linear function of the reserves or of the valuations. 13 After the conference version of this article, Roughgarden and Wang (2016) and Dudik et al (2017) studied such "onlineto-offline reductions" in more general settings. One of their results gives conditions on an approximation algorithm under which it can be translated into an equally good online learning algorithm, and it includes our Theorem 4.1 as a special case.…”
Section: Our Resultsmentioning
confidence: 99%
“…But in the MMR problem, the revenue is not a linear function of the reserves or of the valuations. 13 After the conference version of this article, Roughgarden and Wang (2016) and Dudik et al (2017) studied such "onlineto-offline reductions" in more general settings. One of their results gives conditions on an approximation algorithm under which it can be translated into an equally good online learning algorithm, and it includes our Theorem 4.1 as a special case.…”
Section: Our Resultsmentioning
confidence: 99%
“…original game.Moreover, as we show in Theorem 1 there is a polynomial time algorithm for finding the Stackelberg equilibrium of such games when the leader can optimize a linear function over the actions of the follower, which is a similar condition to the ones used for computing Stackelberg equilibria in large zero-sum games (Ahmadinejad et al, 2016;Dudík et al, 2017;Kalai and Vempala, 2005).…”
Section: Our Results and Contributionsmentioning
confidence: 89%
“…We first note that when c e and C e are set to 0 for all e ∈ E, this game is zero-sum and can be efficiently solved when each player can compute its best-response to any choice of mixed strategy of the other player, i.e., optimize a linear function over the strategy space of the other player using existing results (Ahmadinejad et al, 2016;Dudík et al, 2017;Kalai and Vempala, 2005).…”
Section: Our Results and Contributionsmentioning
confidence: 99%
“…All our algorithms perform a single oracle call per round. a In the extended abstract [20], we presented the regret bound O (nm 2 √ T ) and running time O (T 2 + nmT ), corresponding to discretized level auctions with distinct thresholds (see Theorem 3.13). Here, we present the result that allows repetitions of threshold values (see Theorem 3.15).…”
Section: Main Application: Online Auction Designmentioning
confidence: 99%
“…b The regime of interest in this problem is s n. The extended abstract [20] contained a worse bound O (n 6 √ T ).…”
Section: Main Application: Online Auction Designmentioning
confidence: 99%