Performance Effectiveness of Multimedia Information Search Using the Epsilon-Greedy Algorithm

Kuang, Nikki Lijing; Leung, Clement H. C.

doi:10.1109/icmla.2019.00160

Cited by 10 publications

(6 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Including improved algorithms, this experiment employed six different algorithms. Based on Mambou's understanding of the Epsilon algorithm, we set it accordingly [9]. B was set to 4.4 based on the distribution of the data.…”

Section: Prediction Of Resultsmentioning

confidence: 99%

Optimizing video click-through rates with bandit algorithms

Liu

2024

ACE

View full text Add to dashboard Cite

In recent years, videos have increasingly influenced public perception, making video platforms a focal point of digital consumption. One critical challenge for platform operators is identifying videos that resonate most with users, as user ratings directly reflect viewer preferences and experiences. This study explores the use of bandit algorithms to predict and strategize the overall ratings of various anime videos on the Bilibili platform. Bandit algorithms, a subset of the multi-armed bandit model, dynamically adjust selection strategies based on prior feedback to maximize cumulative rewards. Our empirical research assessed multiple gambling algorithms, including the -greedy, Upper Confidence Bound (UCB), Explore-then-Commit (ETC), and Thompson Sampling (TS) algorithms. The findings indicate that the Thompson Sampling algorithm, in particular, achieved the lowest cumulative regret in selecting optimal videos on the Bilibili platform, showcasing its superior performance. This study highlights the potential of bandit algorithms to enhance video selection processes, ensuring that platforms can effectively cater to user preferences and enhance viewer satisfaction.

show abstract

Section: Prediction Of Resultsmentioning

confidence: 99%

Optimizing video click-through rates with bandit algorithms

Liu

2024

ACE

View full text Add to dashboard Cite

show abstract

“…But, with the introduction of certain level of randomness, the agent, even after having found a solution, will continue to look for other solutions. In epsilon-greedy method, the agent will perform random actions if it satisfies a certain condition [12].…”

Section: E Epsilon-greedy Explorationmentioning

confidence: 99%

“…The Epsilon-Greed Action Selection was then introduced to allow the agent to continue exploration despite having found the solution in the work of Michael Wunder; et al, [12]. It shows how the epsilon-greedy exploration yields higher-than-Nash outcomes.…”

Section: Literature Surveymentioning

confidence: 99%

A Brief Study of Deep Reinforcement Learning with Epsilon-Greedy Exploration

Hariharan¹,

Anand²

2022

IJCDS

View full text Add to dashboard Cite

This paper analyses a simple epsilon-greedy exploration approach to train models with Deep Q-Learning algorithm to involve randomness that helps prevail the agent over conforming to a single solution. This allows the agent to explore different solutions for a problem even after finding a solution. This helps the agent find the global optimum solution without being stuck in a local optimum. A simple block environment is built and used to assess the agent's ability to reach the destination, block A to reach block B. The model is trained repeatedly by feeding the game image and rewarding it based on the decisions made. The weights of the Neural Network of the Reinforcement Learning model are then adjusted by training the model after every iteration to improve the result. Furthermore, two different environments from the Gym library in Python is used to corroborate the results obtained. Here we have used TensorFlow to build and implement the model on the GPU for better and accelerated computation.

show abstract

“…The Epsilon-greedy method combines the random algorithm and the greedy algorithm to deal with the exploration and exploitation dilemma [16,28]. The main idea of the Epsilon-greedy is to control the utilization rate of the greedy algorithm or the random algorithm through a small probability (smaller than 1) with the aim to make the behavior of the Epsilon-greedy to be greedy most of the time, but random once in a while.…”

Section: Epsilon-greedymentioning

confidence: 99%

A multi-objective hyper-heuristic algorithm based on adaptive epsilon-greedy selection

Yang

Zhang

2021

Complex Intell. Syst.

View full text Add to dashboard Cite

A variety of meta-heuristics have shown promising performance for solving multi-objective optimization problems (MOPs). However, existing meta-heuristics may have the best performance on particular MOPs, but may not perform well on the other MOPs. To improve the cross-domain ability, this paper presents a multi-objective hyper-heuristic algorithm based on adaptive epsilon-greedy selection (HH_EG) for solving MOPs. To select and combine low-level heuristics (LLHs) during the evolutionary procedure, this paper also proposes an adaptive epsilon-greedy selection strategy. The proposed hyper-heuristic can solve problems from varied domains by simply changing LLHs without redesigning the high-level strategy. Meanwhile, HH_EG does not need to tune parameters, and is easy to be integrated with various performance indicators. We test HH_EG on the classical DTLZ test suite, the IMOP test suite, the many-objective MaF test suite, and a test suite of a real-world multi-objective problem. Experimental results show the effectiveness of HH_EG in combining the advantages of each LLH and solving cross-domain problems.

show abstract

Performance Effectiveness of Multimedia Information Search Using the Epsilon-Greedy Algorithm

Cited by 10 publications

References 38 publications

Optimizing video click-through rates with bandit algorithms

Optimizing video click-through rates with bandit algorithms

A Brief Study of Deep Reinforcement Learning with Epsilon-Greedy Exploration

A multi-objective hyper-heuristic algorithm based on adaptive epsilon-greedy selection

Contact Info

Product

Resources

About