2017
DOI: 10.1287/mksc.2016.1023
|View full text |Cite
|
Sign up to set email alerts
|

Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments

Abstract: Firms using online advertising regularly run experiments with multiple versions of their ads since they are uncertain about which ones are most effective. During a campaign, firms try to adapt to intermediate results of their tests, optimizing what they earn while learning about their ads. Yet how should they decide what percentage of impressions to allocate to each ad? This paper answers that question, resolving the well-known "learn-and-earn" trade-off using multi-armed bandit (MAB) methods. The online adver… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
46
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 199 publications
(46 citation statements)
references
References 52 publications
0
46
0
Order By: Relevance
“…It then becomes of interest to study policies which prescribe actions that maximize the reward over all interactions (e.g., max P t ¼ T t ¼ 1 R t jA t ; X t ). There is a broad literature on the topic (see, e.g., Berry and Fristedt, 1985;Audibert et al, 2009;Scott, 2010), and within the marketing literature researchers are already approaching the personalization problem as a contextual bandit problem (Hauser et al, 2009(Hauser et al, , 2014Schwartz et al, 2013). Appreciation of the inherent uncertainty in the coupling between user, message content, and observed behavior by exploring different policies is, in our view, a key next step for the development of personalized persuasive systems.…”
Section: Exploration Versus Exploitation In Personalizationmentioning
confidence: 99%
“…It then becomes of interest to study policies which prescribe actions that maximize the reward over all interactions (e.g., max P t ¼ T t ¼ 1 R t jA t ; X t ). There is a broad literature on the topic (see, e.g., Berry and Fristedt, 1985;Audibert et al, 2009;Scott, 2010), and within the marketing literature researchers are already approaching the personalization problem as a contextual bandit problem (Hauser et al, 2009(Hauser et al, , 2014Schwartz et al, 2013). Appreciation of the inherent uncertainty in the coupling between user, message content, and observed behavior by exploring different policies is, in our view, a key next step for the development of personalized persuasive systems.…”
Section: Exploration Versus Exploitation In Personalizationmentioning
confidence: 99%
“…Another combines methods from cloud computing, machine learning, and text mining to demonstrate how online social platforms such as Twitter can be used for sales forecasting (Liu et al 2016). A fifth paper that will appear in a subsequent issue, uses methods from multi-armed bandit problems to identify the banner-advertising characteristics that are most likely to appeal to consumers (Schwartz et al 2016).…”
mentioning
confidence: 99%
“…Another stream proposes to select a query that minimizes regret or cost, when making decisions under strict uncertainty (e.g., Wang and Boutilier 2003). An independent yet related methodology is the multi-armed bandit (MAB), which simultaneously considers maximum performance and minimum regret (Schwartz et al 2016). It is widely used in online field experiments for digital content optimization (e.g., Brezzi and Lai 2002, Scott 2010, Schwartz et al 2016 to balance between profits during the experimental phase and the optimal outcome.…”
Section: Relationship To Prior Literaturementioning
confidence: 99%