2019
DOI: 10.1007/978-3-030-37494-5_2
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning Method for Ad Networks Ordering in Real-Time Bidding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…Advancements in DRL approaches in recent years have enabled considerable progress for the domain of COP applications [Cappart et al, 2021, Oren et al, 2021. Some of the major COPs have been successfully solved using DRL such as the Travelling Salesman Problem (TSP) [Zhang et al, 2021, d O Costa et al, 2020, Zhang et al, 2020b, the Knap Sack Problem [Afshar et al, 2020, Cappart et al, 2021 and the Steiner Tree Problem [Du et al, 2021]. Zhang and Dietterich [1995] were able to show the potential of Reinforcement Learning (RL) for JSSPs as far back as 1995, by improving the results of the scheduling algorithm by Deale et al [1994] which used a temporal difference algorithm in combination with simulated annealing.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Advancements in DRL approaches in recent years have enabled considerable progress for the domain of COP applications [Cappart et al, 2021, Oren et al, 2021. Some of the major COPs have been successfully solved using DRL such as the Travelling Salesman Problem (TSP) [Zhang et al, 2021, d O Costa et al, 2020, Zhang et al, 2020b, the Knap Sack Problem [Afshar et al, 2020, Cappart et al, 2021 and the Steiner Tree Problem [Du et al, 2021]. Zhang and Dietterich [1995] were able to show the potential of Reinforcement Learning (RL) for JSSPs as far back as 1995, by improving the results of the scheduling algorithm by Deale et al [1994] which used a temporal difference algorithm in combination with simulated annealing.…”
Section: Related Workmentioning
confidence: 99%
“…The implementation of DRL in the field of Operational Research (OR) has become quite significant. Several studies incorporating DRL to solve COP have shown promising results [Du et al, 2021, Afshar et al, 2020. Moreover, DRL provides a significantly faster approximation for COPs compared to exhaustive search, metaheuristics, or other conventional heuristics.…”
Section: Introductionmentioning
confidence: 99%
“…Preliminary analysis on the relations between features like floor price and the revenue are discussed in [27]. We Fig.…”
Section: Data Descriptionmentioning
confidence: 99%