2023
DOI: 10.1287/mnsc.2022.4558
|View full text |Cite
|
Sign up to set email alerts
|

Online Learning via Offline Greedy Algorithms: Applications in Market Design and Optimization

Abstract: Motivated by online decision making in time-varying combinatorial environments, we study the problem of transforming offline algorithms to their online counterparts. We focus on offline combinatorial problems that are amenable to a constant factor approximation using a greedy algorithm that is robust to local errors. For such problems, we provide a general framework that efficiently transforms offline robust greedy algorithms to online ones using Blackwell approachability. We show that the resulting online alg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 27 publications
0
7
0
Order By: Relevance
“…Adversarial CMAB The closest related works are on adversarial CMAB. In (Niazadeh et al, 2021), the authors propose a framework for transforming greedy α-approximation algorithms for offline problems to online methods in an adversarial bandit setting, for both semi-bandit (achieving O(T 1/2 ) α−regret) and full-bandit feedback (achieving O(T 2/3 ) α−regret). Their framework requires the offline approximation algorithm to have an iterative greedy structure (unlike ours), satisfy a robustness property (like ours), and satisfy a property referred to as Blackwell reducibility (unlike ours).…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Adversarial CMAB The closest related works are on adversarial CMAB. In (Niazadeh et al, 2021), the authors propose a framework for transforming greedy α-approximation algorithms for offline problems to online methods in an adversarial bandit setting, for both semi-bandit (achieving O(T 1/2 ) α−regret) and full-bandit feedback (achieving O(T 2/3 ) α−regret). Their framework requires the offline approximation algorithm to have an iterative greedy structure (unlike ours), satisfy a robustness property (like ours), and satisfy a property referred to as Blackwell reducibility (unlike ours).…”
Section: Related Workmentioning
confidence: 99%
“…We also note that (Niazadeh et al, 2021) do not consider submodular CMAB with knapsack constraints, and thus do not verify whether any approximation algorithms for the offline problem satisfy the required properties (of sub-problem structure or robustness or Blackwell reducibility) to be transformed, and this is an example we consider for our general framework. Consequently, in our experiments for submodular CMAB with knapsack constraints in Section 7, we use the algorithm in (Streeter and Golovin, 2008) designed for a knapsack constraint (in expectation) as representative of methods for the adversarial setting.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Many works [5,9,11,10,24] have been proposed to deal with demand learning or price experimentation problems with the help of the framework in the field of revenue management. Recently, more methods [40,17,13,14,6,44] adopt similar techniques in the product ranking setting, which inspires the algorithm proposed in this paper.…”
Section: Related Workmentioning
confidence: 99%