2018 International Joint Conference on Neural Networks (IJCNN) 2018
DOI: 10.1109/ijcnn.2018.8489092
|View full text |Cite
|
Sign up to set email alerts
|

Targeting Optimization for Internet Advertising by Learning from Logged Bandit Feedback

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…From an application point of view, only a few works are present to deal with regret minimization and best arm identification for specific SPNB scenarios. In the Internet advertising management field, a method to select the most profitable time slot during the day has been presented in [12]. Nonetheless, this method provides suggestions in an offline fashion, exploiting the information provided from historical data, not including any procedure to include a newly discovered piece of information.…”
Section: Related Workmentioning
confidence: 99%
“…From an application point of view, only a few works are present to deal with regret minimization and best arm identification for specific SPNB scenarios. In the Internet advertising management field, a method to select the most profitable time slot during the day has been presented in [12]. Nonetheless, this method provides suggestions in an offline fashion, exploiting the information provided from historical data, not including any procedure to include a newly discovered piece of information.…”
Section: Related Workmentioning
confidence: 99%
“…The campaigns distinguish for the ad, channel, and target. In real-world applications, the set of campaigns and the spending plan can be optimized from data, e.g., by setting up campaigns with specific targets and adopting a different cumulative daily budget for every day [31]. 5 For the sake of presentation, from now on, we set one day as the unitary temporal step of our algorithms.…”
Section: Optimization Problem Formulationmentioning
confidence: 99%
“…The derivatives of loss function L(θ as ) w.r.t. parameters θ as can be presented as: ∇ θ as L(θ as ) = y as t − Q(s t , a as t ; θ as ) ∇ θ as Q(s t , a as t ; θ as ) (13) where y as t = r t (s t , a as t ) + γQ * s t +1 , BS Q * (s t +1 , A as t +1 ; θ as T ) ; θ as T . The operation Q(s t , A as t ) looks through the candidate ad set {a ad t +1 } and all locations {a loc t +1 } (including the location of inserting no ad).…”
Section: The Optimization Taskmentioning
confidence: 99%
“…The second group is real-time bidding (RTB), which allows an advertiser to bid each ad impression in a very short time slot. Ad selection task is typically modeled as multi-armed bandit problem supposing that arms are iid, feedback is immediate and environments are stationary [13,19,23,25,31,32]. The problem of online advertising with budget constraints and variable costs is studied in MAB setting [10], where pulling the arms of bandit results in random rewards and spends random costs.…”
Section: Related Workmentioning
confidence: 99%