2012
DOI: 10.1108/17563781211208233
|View full text |Cite
|
Sign up to set email alerts
|

A version of Geiringer‐like theorem for decision making in the environments with randomness and incomplete information

Abstract: Received (?? ?? 2011) Revised (Day Month Year)Purpose-In recent years Monte-Carlo sampling methods, such as Monte Carlo tree search, have achieved tremendous success in model free reinforcement learning. A combination of the so called upper confidence bounds policy to preserve the "exploration vs. exploitation" balance to select actions for sample evaluations together with massive computing power to store and to update dynamically a rather large pre-evaluated game tree lead to the development of software that … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
50
0

Year Published

2013
2013
2013
2013

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(50 citation statements)
references
References 22 publications
0
50
0
Order By: Relevance
“…Just as in [3], a convenient way to represent a similarity relation C on a set of states S is to assign a positive integer to each similarity set O ∈ C in a one-to-one fashion. Each element of a set O labeled by an integer l is then uniquely determined by an additional alphabet symbol.…”
Section: Mathematical Framework and Notation 21 Set Cover Of The Stamentioning
confidence: 99%
See 4 more Smart Citations
“…Just as in [3], a convenient way to represent a similarity relation C on a set of states S is to assign a positive integer to each similarity set O ∈ C in a one-to-one fashion. Each element of a set O labeled by an integer l is then uniquely determined by an additional alphabet symbol.…”
Section: Mathematical Framework and Notation 21 Set Cover Of The Stamentioning
confidence: 99%
“…Each element of a set O labeled by an integer l is then uniquely determined by an additional alphabet symbol. Unlike the case in [3], it is possible for the same state to be labeled in a number of different ways as long as the corresponding integer labels differ. An example appears below.…”
Section: Mathematical Framework and Notation 21 Set Cover Of The Stamentioning
confidence: 99%
See 3 more Smart Citations