2019
DOI: 10.1287/mksc.2018.1129
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments

Abstract: Consider the pricing decision for a manager at a large online retailer, that sells millions of products.A manager must decide on real-time prices for each of these products. It is infeasible to have complete knowledge of demand curve for each product. A manager can run price experiments to learn about demand and maximize long run profits. There are two aspects that make this setting different from traditional brick-and-mortar settings. First, due to the number of products the manager must be able to automate p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 127 publications
(53 citation statements)
references
References 62 publications
0
52
0
1
Order By: Relevance
“…Feit and Berman, 2019) will have a negative impact on the company's profitability during the test period, so-called adaptive testing is recommended (cf. Misra et al, 2019). This will continuously optimize the allocation of test subjects to the test conditions in terms of effectiveness-for example, maximizing the profit or the number of conversions.…”
Section: Ijopm 415mentioning
confidence: 99%
“…Feit and Berman, 2019) will have a negative impact on the company's profitability during the test period, so-called adaptive testing is recommended (cf. Misra et al, 2019). This will continuously optimize the allocation of test subjects to the test conditions in terms of effectiveness-for example, maximizing the profit or the number of conversions.…”
Section: Ijopm 415mentioning
confidence: 99%
“…The UCB approach allows to reach advantages, e.g. in terms of pricing applications when there is a need to identify the selling price without knowing the consumer demand: Misra et al (2019) highlighted that modified UCB algorithms statistical machine learning blended with partial identification of consumer demand, from economic theory, allowed to faster define the optimal price than other standard methods; Trov o et al (2015) with a modified UCB provide empirical evidence that proposed variations speed up the learning process in pricing applications. Generally, a modified UCB approach can significantly strike a balance between exploration and exploitation (Chen et al, 2020;Hussain and Michelusi, 2019;Xing et al, 2014).…”
Section: Multi-armed Bandit and Upper Confidence Bound Bandit Algorithmsmentioning
confidence: 99%
“…Chick and Inoue (2001) analyze a two-stage decision problem where the cost of the test is a fixed multiple of the sample sizes, rather than actual opportunity cost as we have here. In studying multi-armed bandits Schwartz et al (2017) and Misra et al (2019) use a test & roll as a benchmark, but they do not optimize the sample size. The closest work comes from the clinical trials literature, where Cheng et al (2003) defines the same test & roll problem with a finite "patient horizon" and approximates the optimal sample size for Bernoulli responses with beta priors.…”
Section: The Test and Roll Decision Problemmentioning
confidence: 99%