2021
DOI: 10.48550/arxiv.2108.00490
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A survey of Monte Carlo methods for noisy and costly densities with application to reinforcement learning

Abstract: This survey gives an overview of Monte Carlo methodologies using surrogate models, for dealing with densities which are intractable, costly, and/or noisy. This type of problem can be found in numerous real-world scenarios, including stochastic optimization and reinforcement learning, where each evaluation of a density function may incur some computationally-expensive or even physical (real-world activity) cost, likely to give different results each time. The surrogate model does not incur this cost, but there … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 58 publications
0
3
0
Order By: Relevance
“…This is called the Markov property. There are three main learning methods for reinforcement learning: dynamic programming (DP) method [17], Monte Carlo (MC) method [18], and time difference (TD) method [19]. The DP method is a method of obtaining optimal behavior by solving the Bellman equation when each parameter of the system is known.…”
Section: B Reinforcement Learningmentioning
confidence: 99%
“…This is called the Markov property. There are three main learning methods for reinforcement learning: dynamic programming (DP) method [17], Monte Carlo (MC) method [18], and time difference (TD) method [19]. The DP method is a method of obtaining optimal behavior by solving the Bellman equation when each parameter of the system is known.…”
Section: B Reinforcement Learningmentioning
confidence: 99%
“…Monte Carlo sampling (see, for example, [4]) is a set of techniques that randomly generate numerical quantities for the purpose of simulating a statistical distribution or computing a moment or other expectation thereof (e.g., mean, variance). It is prominent in many disciplines, including computational finance [5], computational physics [6], artificial intelligence [7,8], and various branches of engineering [9]. Although the concepts and ideas discussed in this paper readily generalize to other disciplines, we present our exposition with a focus on computational finance.…”
Section: Introductionmentioning
confidence: 99%
“…A wide range of modern applications, especially in Bayesian inference framework [1], require the study of probability density functions (pdfs) which can be evaluated stochastically, i.e., only noisy evaluations can be obtained [2,3,4,5]. For instance, this is the case of the pseudo-marginal approaches and doubly intractable posteriors [6,7], approximate Bayesian computation (ABC) and likelihood-free schemes [8,9], where the target density cannot be computed in closed-form.…”
Section: Introductionmentioning
confidence: 99%