2021
DOI: 10.1287/opre.2020.2013
|View full text |Cite
|
Sign up to set email alerts
|

Matching While Learning

Abstract: Platforms face a cold start problem whenever new users arrive: namely, the platform must learn attributes of new users (explore) in order to match them better in the future (exploit). How should a platform handle cold starts when there are limited quantities of the items being recommended? For instance, how should a labor market platform match workers to jobs over the lifetime of the worker, given a limited supply of jobs? In this setting, there is one multiarmed bandit problem for each worker, coupled togethe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(15 citation statements)
references
References 28 publications
0
15
0
Order By: Relevance
“…Many of these papers consider a multi-armed bandit setting where rewards are stochastic and redrawn i.i.d. from an unknown distribution, with algorithms of an explore-exploit nature [13,26,31]. Efficient algorithms that approximate the optimal online algorithm were also studied by [41] in a stochastic setting with a price for querying each edge, relying on a Gittins Index characterization for the optimal online algorithm [15,42].…”
Section: Relationship With Prior Workmentioning
confidence: 99%
“…Many of these papers consider a multi-armed bandit setting where rewards are stochastic and redrawn i.i.d. from an unknown distribution, with algorithms of an explore-exploit nature [13,26,31]. Efficient algorithms that approximate the optimal online algorithm were also studied by [41] in a stochastic setting with a price for querying each edge, relying on a Gittins Index characterization for the optimal online algorithm [15,42].…”
Section: Relationship With Prior Workmentioning
confidence: 99%
“…As a generalization of a paired kidney exchange market, the dynamic matching problem was extended to finding disjoint 3-way circles and chains [19,3,6,9,34]. Anderson et al [5] and Ashlagi et al [8] analyzed the expected waiting time in the market.…”
Section: Stochastic Matching Marketmentioning
confidence: 99%
“…Matching Market with Departures A matching market where each agent is allowed to leave has been studied in various settings. Johari et al [19] and Ashlagi et al [7] studied a matching model where agents would depart after a constant time after arrival. Akbarpour et al [4] introduced a dynamic model where each vertex arrives and departs stochastically on a general (i.e., non-bipartite) network.…”
Section: Stochastic Matching Marketmentioning
confidence: 99%
“…TaskRabbit, UpWork, DoorDash. An emerging line of research [1,16,23,27,24] in the field of multi-agent bandits is dedicated to understanding algorithmic principles in the interplay of competition, learning and regret minimization. The two-sided matching market [14] is one such thread, where regret minimization is first studied in [23] with a centralized arbiter, and in [27,24] at different levels of decentralization.…”
Section: Introductionmentioning
confidence: 99%