2014 IEEE Congress on Evolutionary Computation (CEC) 2014
DOI: 10.1109/cec.2014.6900505
|View full text |Cite
|
Sign up to set email alerts
|

Differential Evolution algorithm applied to non-stationary bandit problem

Abstract: In this paper we compare Differential Evolution (DE), an evolutionary algorithm, to classical bandit algorithms over the non-stationary bandit problem. First we define a testcase where the variation of the distributions depends on the number of times an option is evaluated rather than over time. This definition allows the possibility to apply these algorithms over a wide range of problems such as black-box portfolio selection. Second we present our own variant of discounted Upper Confidence Bound (UCB) algorit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
3
3

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…Mellor and Shapiro (2013) analyze an NS-MAB where the probabilities according to which the expected value of the arms change are a priori fixed and propose the CTS algorithm that combines Thompson Sampling with a change point detection mechanism. St-Pierre and Jialin (2014) present an evolutionary algorithm to deal with generic non-stationary environments which empirically outperforms classical solutions.…”
Section: Related Workmentioning
confidence: 99%
“…Mellor and Shapiro (2013) analyze an NS-MAB where the probabilities according to which the expected value of the arms change are a priori fixed and propose the CTS algorithm that combines Thompson Sampling with a change point detection mechanism. St-Pierre and Jialin (2014) present an evolutionary algorithm to deal with generic non-stationary environments which empirically outperforms classical solutions.…”
Section: Related Workmentioning
confidence: 99%
“…The results show that the UCB algorithm is efficient in algorithm selection problems. Also in [18], authors regarded the algorithm selection problem as a nonstationary bandit problem and applied UCB algorithm to be the decision policy.…”
Section: Multi-armed Bandit Problemmentioning
confidence: 99%
“…In spite of some adaptations to other contexts (time varying as in [26] or adversarial [21,7]), and maybe due to strong differences such as the very non-stationary nature of bandit problems involved in optimization portfolios, these methods did not, for the moment, really find their way to AS. Another approach consists in writing this bandit algorithm as a meta-optimization problem; [38] applies the differential evolution algorithm [39] to some non-stationary bandit problem, which outperforms the classical bandit algorithm on an AS task.…”
Section: Static Portfolios and Parameter Tuningmentioning
confidence: 99%