2014 IEEE 7th Joint International Information Technology and Artificial Intelligence Conference 2014
DOI: 10.1109/itaic.2014.7065060
|View full text |Cite
|
Sign up to set email alerts
|

Maximal expectation as upper confidence bound for multi-armed bandit problems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 9 publications
0
0
0
Order By: Relevance
“…For example, Chen proposed the UCB-max algorithm, Roy referred to the UCB-KL algorithm and their introduction of the TV-KL-UCB algorithm, and Gil. proposed the UCB-RAD auxiliary algorithm, among others [4][5][6]. Considering the potential strong fluctuations in video rating data during execution, this paper also adopts the AUCB (Asymptotically Optimal UCB) algorithm to more accurately estimate the uncertainty of actions.…”
Section: Upper Confidence Bound (Ucb)mentioning
confidence: 99%
“…For example, Chen proposed the UCB-max algorithm, Roy referred to the UCB-KL algorithm and their introduction of the TV-KL-UCB algorithm, and Gil. proposed the UCB-RAD auxiliary algorithm, among others [4][5][6]. Considering the potential strong fluctuations in video rating data during execution, this paper also adopts the AUCB (Asymptotically Optimal UCB) algorithm to more accurately estimate the uncertainty of actions.…”
Section: Upper Confidence Bound (Ucb)mentioning
confidence: 99%