Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining 2022
DOI: 10.1145/3488560.3498396
|View full text |Cite
|
Sign up to set email alerts
|

Non-stationary Continuum-armed Bandits for Online Hyperparameter Optimization

Abstract: For years, machine learning has become the dominant approach to a variety of information retrieval tasks. The performance of machine learning algorithms heavily depends on their hyperparameters. It is hence critical to identity the optimal hyperparameter configuration when applying machine learning algorithms. Most of existing hyperparameter optimization methods assume a static relationship between hyperparameter configuration and algorithmic performance and are thus not suitable for many information retrieval… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 33 publications
0
0
0
Order By: Relevance
“…Non-stationary multi-armed bandits have attracted intensive attention in the past years, both from the theory [9]- [13] and the application [14]- [17] side. Potential application domains span across different fields, including online recommender systems [3], [15]- [17], hyperparameter optimization [18], virtual reality for rehabilitation [19], split liver transplantation allocation [20], evaluation of information retrieval systems [21], or targeted Covid-19 border testing of travelers [22]. The state-of-the-art methods in non-stationary bandits either do not consider access to contextual information or do not assume costly information acquisition.…”
Section: A Related Workmentioning
confidence: 99%
“…Non-stationary multi-armed bandits have attracted intensive attention in the past years, both from the theory [9]- [13] and the application [14]- [17] side. Potential application domains span across different fields, including online recommender systems [3], [15]- [17], hyperparameter optimization [18], virtual reality for rehabilitation [19], split liver transplantation allocation [20], evaluation of information retrieval systems [21], or targeted Covid-19 border testing of travelers [22]. The state-of-the-art methods in non-stationary bandits either do not consider access to contextual information or do not assume costly information acquisition.…”
Section: A Related Workmentioning
confidence: 99%
“…Currently, many efforts have been devoted to dealing with sequential decision-making problems using bandit-based approaches, where the decision maker seeks to select the action with the highest expected reward among action candidates under uncertainty [17]. Applications have been applied to several fields, for instance, recommendation systems [18], information retrieval [19], healthcare [20], etc. Several current pieces of research about the application of bandit-based approaches to the energy resource management problem are provided in Table 1.…”
Section: Literature Review and Research Gapmentioning
confidence: 99%
“…Since the predicted load and demand response requirement have uncertainty, some DG, GT, and curtailable load capacities are spared to improve the reliability with a certain probability. The chance constraints are formulated in (19) and (20). δ load t , δ DR t are the corresponding random variables of the estimation errors and α is the risk parameter.…”
Section: Gas Turbinementioning
confidence: 99%
See 1 more Smart Citation