2022
DOI: 10.48550/arxiv.2205.03699
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Rate-Optimal Contextual Online Matching Bandit

Abstract: Two-sided online matching platforms have been employed in various markets. However, agents' preferences in the present market are usually implicit and unknown, and thus must be learned from data. With the growing availability of side information involved in the decision process, modern online matching methodology demands the capability to track preference dynamics for agents based on the contextual information. This motivates us to consider a novel Contextual Online Matching Bandit prOblem (COMBO), which allow… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 33 publications
0
1
0
Order By: Relevance
“…Recently, there is a growing line of research in the statistics literature for policy learning and/or evaluation in infinite horizons. Some references include Chen et al (2022), Ertefaie and Strawderman (2018), Liao et al (2020), Liao et al (2021), Li et al (2022), Luckett et al (2020), Ramprasad et al (2022), Shi et al (2022, and Xu et al (2020). In the computer science literature, there is a huge literature on developing reinforcement learning (RL) algorithms in infinite horizons.…”
mentioning
confidence: 99%
“…Recently, there is a growing line of research in the statistics literature for policy learning and/or evaluation in infinite horizons. Some references include Chen et al (2022), Ertefaie and Strawderman (2018), Liao et al (2020), Liao et al (2021), Li et al (2022), Luckett et al (2020), Ramprasad et al (2022), Shi et al (2022, and Xu et al (2020). In the computer science literature, there is a huge literature on developing reinforcement learning (RL) algorithms in infinite horizons.…”
mentioning
confidence: 99%