Oracle-Efficient Online Learning and Auction Design

Dudík, Miroslav; Haghtalab, Nika; Luo, Haipeng; Schapire, Robert E.; Syrgkanis, Vasilis; Vaughan, J.

doi:10.1109/focs.2017.55

Cited by 27 publications

(43 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…But in the MMR problem, the revenue is not a linear function of the reserves or of the valuations. 13 After the conference version of this article, Roughgarden and Wang (2016) and Dudik et al (2017) studied such "onlineto-offline reductions" in more general settings. One of their results gives conditions on an approximation algorithm under which it can be translated into an equally good online learning algorithm, and it includes our Theorem 4.1 as a special case.…”

Section: Our Resultsmentioning

confidence: 99%

Minimizing Regret with Multiple Reserves

Roughgarden

Wang

2019

ACM Trans. Econ. Comput.

View full text Add to dashboard Cite

We study the problem of computing and learning non-anonymous reserve prices to maximize revenue. We first define the M aximizing M ultiple R eserves (MMR) problem in single-parameter matroid environments, where the input is m valuation profiles v 1 ,…, v m , indexed by the same n bidders, and the goal is to compute the vector r of (non-anonymous) reserve prices that maximizes the total revenue obtained on these profiles by the VCG mechanism with reserves r . We prove that the problem is APX -hard, even in the special case of single-item environments, and give a polynomial-time 1/2-approximation algorithm for it in arbitrary matroid environments. We then consider the online no-regret learning problem and show how to exploit the special structure of the MMR problem to translate our offline approximation algorithm into an online learning algorithm that achieves asymptotically time-averaged revenue at least 1/2 times that of the best fixed reserve prices in hindsight. On the negative side, we show that, quite generally, computational hardness for the offline optimization problem translates to computational hardness for obtaining vanishing time-averaged regret. Thus, our hardness result for the MMR problem implies that computationally efficient online learning requires approximation, even in the special case of single-item auction environments.

show abstract

Section: Our Resultsmentioning

confidence: 99%

Minimizing Regret with Multiple Reserves

Roughgarden

Wang

2019

ACM Trans. Econ. Comput.

View full text Add to dashboard Cite

show abstract

“…original game.Moreover, as we show in Theorem 1 there is a polynomial time algorithm for finding the Stackelberg equilibrium of such games when the leader can optimize a linear function over the actions of the follower, which is a similar condition to the ones used for computing Stackelberg equilibria in large zero-sum games (Ahmadinejad et al, 2016;Dudík et al, 2017;Kalai and Vempala, 2005).…”

Section: Our Results and Contributionsmentioning

confidence: 89%

“…We first note that when c e and C e are set to 0 for all e ∈ E, this game is zero-sum and can be efficiently solved when each player can compute its best-response to any choice of mixed strategy of the other player, i.e., optimize a linear function over the strategy space of the other player using existing results (Ahmadinejad et al, 2016;Dudík et al, 2017;Kalai and Vempala, 2005).…”

Section: Our Results and Contributionsmentioning

confidence: 99%

Computing Stackelberg Equilibria of Large General-Sum Games

Blum

Haghtalab

Hajiaghayi

et al. 2019

Algorithmic Game Theory

Self Cite

View full text Add to dashboard Cite

We study the computational complexity of finding Stackelberg Equilibria in general-sum games, where the set of pure strategies of the leader and the followers are exponentially large in a natrual representation of the problem.In zero-sum games, the notion of a Stackelberg equilibrium coincides with the notion of a Nash Equilibrium Korzhyk et al. (2011b). Finding these equilibrium concepts in zero-sum games can be efficiently done when the players have polynomially many pure strategies or when (in additional to some structural properties) a best-response oracle is available Ahmadinejad et al. (2016); Dudík et al. (2017); Kalai and Vempala (2005). Despite such advancements in the case of zero-sum games, little is known for general-sum games.In light of the above, we examine the computational complexity of computing a Stackelberg equilibrium in large general-sum games. We show that while there are natural large general-sum games where the Stackelberg Equilibria can be computed efficiently if the Nash equilibrium in its zero-sum form could be computed efficiently, in general, structural properties that allow for efficient computation of Nash equilibrium in zero-sum games are not sufficient for computing Stackelberg equilibria in general-sum games.

show abstract

“…All our algorithms perform a single oracle call per round. a In the extended abstract [20], we presented the regret bound O (nm 2 √ T ) and running time O (T 2 + nmT ), corresponding to discretized level auctions with distinct thresholds (see Theorem 3.13). Here, we present the result that allows repetitions of threshold values (see Theorem 3.15).…”

Section: Main Application: Online Auction Designmentioning

confidence: 99%

“…b The regime of interest in this problem is s n. The extended abstract [20] contained a worse bound O (n 6 √ T ).…”

Section: Main Application: Online Auction Designmentioning

confidence: 99%

Oracle-efficient Online Learning and Auction Design

Dudík

Haghtalab

Luo

et al. 2020

J. ACM

Self Cite

View full text Add to dashboard Cite

We consider the design of computationally efficient online learning algorithms in an adversarial setting in which the learner has access to an offline optimization oracle. We present an algorithm called Generalized Follow-the-Perturbed-Leader and provide conditions under which it is oracle-efficient while achieving vanishing regret. Our results make significant progress on an open problem raised by Hazan and Koren [31], who showed that oracle-efficient algorithms do not exist in general [30] and asked whether one can identify properties under which oracle-efficient online learning may be possible. Our auction-design framework considers an auctioneer learning an optimal auction for a sequence of adversarially selected valuations with the goal of achieving revenue that is almost as good as the optimal auction in hindsight, among a class of auctions. We give oracle-efficient learning results for: (1) VCG auctions with bidder-specific reserves in single-parameter settings, (2) envy-free item pricing in multi-item auctions, and (3) s-level auctions of Morgenstern and Roughgarden [43] for single-item settings. The last result leads to an approximation of the overall optimal Myerson auction when bidders’ valuations are drawn according to a fast-mixing Markov process, extending prior work that only gave such guarantees for the i.i.d. setting. Finally, we derive various extensions, including: (1) oracle-efficient algorithms for the contextual learning setting in which the learner has access to side information (such as bidder demographics), (2) learning with approximate oracles such as those based on Maximal-in-Range algorithms, and (3) no-regret bidding in simultaneous auctions, resolving an open problem of Daskalakis and Syrgkanis [14].

show abstract

Oracle-Efficient Online Learning and Auction Design

Cited by 27 publications

References 37 publications

Minimizing Regret with Multiple Reserves

Minimizing Regret with Multiple Reserves

Computing Stackelberg Equilibria of Large General-Sum Games

Oracle-efficient Online Learning and Auction Design

Contact Info

Product

Resources

About