2012 IEEE 11th International Conference on Cognitive Informatics and Cognitive Computing 2012
DOI: 10.1109/icci-cc.2012.6311198
|View full text |Cite
|
Sign up to set email alerts
|

A new learning automaton for interaction with triple level environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…Generally speaking, the relationship between LA and SPL has been discussed in [29], which pointed out that SPL actually generalized LA strategy for the situation that the optimal action is selected from an infinite set: LA tries to learn the optimal action, defined as what the action can maximize the reward received from the environment, while in the SPL problem the learning mechanism is trying to locate an unknown point on a real interval. The above investigations can also be applied to distinguish the algorithm proposed in this paper from that in [30] and [31]. Although both of them try to interact with the environment with three responses, their corresponding environments and purposes are different.…”
Section: B Triple Level Stochastic Point Location Problemmentioning
confidence: 99%
“…Generally speaking, the relationship between LA and SPL has been discussed in [29], which pointed out that SPL actually generalized LA strategy for the situation that the optimal action is selected from an infinite set: LA tries to learn the optimal action, defined as what the action can maximize the reward received from the environment, while in the SPL problem the learning mechanism is trying to locate an unknown point on a real interval. The above investigations can also be applied to distinguish the algorithm proposed in this paper from that in [30] and [31]. Although both of them try to interact with the environment with three responses, their corresponding environments and purposes are different.…”
Section: B Triple Level Stochastic Point Location Problemmentioning
confidence: 99%