Dynamic Algorithm Selection Using Reinforcement Learning

Armstrong, Warren; Christen, Peter; McCreath, Eric; Rendell, Alistair P.

doi:10.1109/aidm.2006.4

Cited by 20 publications

(21 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Along these lines, Armstrong et al [1] introduce a general scheme for adaptive algorithms that employs reinforcement learning in the same manner: an 'optimization system' collects sensor data (i.e., features of the input data), invokes a reinforcement learning algorithm, and executes its decisions (i.e., which algorithm to use). The (negated) execution time of the selected algorithm serves as reward.…”

Section: Background and Related Workmentioning

confidence: 99%

“…To do so, we follow [1] and introduce a preprocessor function p : Σ * → S that aggregates a trajectory τ ∈ Σ * to a state s ∈ S. As not all states of the environment are distinguishable anymore, we now deal with a partially observable Markov decision process.…”

Section: A Generic Adaptive Simulatormentioning

confidence: 99%

“…Firstly, we integrate it into a component-based simulation system that offers many more adaptation options. Even rather simple learning schemes may improve the performance when there are just two (as in [1]) or three (as in [23]) possible actions, but it is still unclear how well this scales to more actions. Secondly, our approach is tailored toward simulations.…”

Section: Background and Related Workmentioning

confidence: 99%

See 2 more Smart Citations

A generic adaptive simulation algorithm for component-based simulation systems

Helms

Ewald

Rybacki

et al. 2013

Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation

View full text Add to dashboard Cite

The state of a model may strongly vary during simulation, and with it also the simulation's computational demands. Adapting the simulation algorithm to these demands at runtime can therefore improve the overall performance. Although this is a general and cross-cutting concern, only few simulation systems offer re-usable support for this kind of runtime adaptation. We present a flexible and generic mechanism for the runtime adaptation of component-based simulation algorithms. It encapsulates simulation algorithms applicable to a given problem and employs reinforcement learning to explore the algorithms' suitability during a simulation run. We evaluate the approach by executing models from two modeling formalisms used in computational biology.

show abstract

Section: Background and Related Workmentioning

confidence: 99%

Section: A Generic Adaptive Simulatormentioning

confidence: 99%

Section: Background and Related Workmentioning

confidence: 99%

See 1 more Smart Citation

A generic adaptive simulation algorithm for component-based simulation systems

Helms

Ewald

Rybacki

et al. 2013

Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation

View full text Add to dashboard Cite

show abstract

“…[14,15] propose 50% for the best solver, 25% for the second best, and so on. Some selection solvers [15,2] do not need a separate training phase, and performs entirely online solver selection; a weakness of this approach is that it is only possible when a large enough budget is available, so that the training phase has a minor cost. At the moment, the case of portfolios of noisy optimization solvers has not been discussed in the literature.…”

Section: Algorithm Selectionmentioning

confidence: 99%

Algorithm Portfolios for Noisy Optimization: Compare Solvers Early

Cauwet

Liu

Teytaud

2014

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Noisy optimization is the optimization of objective functions corrupted by noise. A portfolio of algorithms is a set of algorithms equipped with an algorithm selection tool for distributing the computational power among them. We study portfolios of noisy optimization solvers, show that different settings lead to different performances, obtain mathematically proved performance (in the sense that the portfolio performs nearly as well as the best of its algorithms) by an ad hoc selection algorithm dedicated to noisy optimization. A somehow surprising result is that it is better to compare solvers with some lag; i.e., recommend the current recommendation of the best solver, selected from a comparison based on their recommendations earlier in the run.

show abstract

“…This is a typical online model selection problem, for which, many algorithms have been developed and applied in a wide variety of algorithm-problem settings. In [13,1], for instance, model selection is formulated as a Markov Decision Process, and reinforcement learning is used to find the optimal algorithm scheduling strategy. In [9,6], multi-armed bandit algorithms are employed to find the optimal assignment of available computational resources.…”

Section: Introductionmentioning

confidence: 99%

Online model racing based on extreme performance

Zhang

Georgiopoulos

Anagnostopoulos

2014

Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation

View full text Add to dashboard Cite

Racing algorithms are often used for offline model selection, where models are compared in terms of their average performance over a collection of problems. In this paper, we present a new racing algorithm variant, Max-Race, which makes decisions based on the maximum performance of models. It is an online algorithm, whose goal is to optimally allocate computational resources in a portfolio of evolutionary algorithms, while solving a particular problem instance. It employs a hypothesis test based on extreme value theory in order to decide, which component algorithms to retire, while avoiding unnecessary computations. Experimental results confirm that Max-Race is able to identify the best individual with high precision and low computational overhead. When used as a scheme to select the best from a portfolio of algorithms, the results compare favorably to the ones of other popular algorithm portfolio approaches.

show abstract

Dynamic Algorithm Selection Using Reinforcement Learning

Cited by 20 publications

References 10 publications

A generic adaptive simulation algorithm for component-based simulation systems

A generic adaptive simulation algorithm for component-based simulation systems

Algorithm Portfolios for Noisy Optimization: Compare Solvers Early

Online model racing based on extreme performance

Contact Info

Product

Resources

About