Proceedings of the Fifth International Joint Conference on Autonomous Agents and Multiagent Systems 2006
DOI: 10.1145/1160633.1160779
|View full text |Cite
|
Sign up to set email alerts
|

Improving reinforcement learning with context detection

Abstract: In this paper we propose a method for solving reinforcement learning problems in non-stationary environments. The basic idea is to create and simultaneously update multiple partial models of the environment dynamics. The learning mechanism is based on the detection of context changes, that is, on the detection of significant changes in the dynamics of the environment. Based on this motivation, we propose, formalize and show the efficiency of a method for detecting the current context and the associated model o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2006
2006
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 4 publications
0
8
0
Order By: Relevance
“…It is the trade-off of the two extremes (centralised and decentralised) formally called as DCEE framework [45]. The adjacent agents information are coordinated for better traffic flows in [21,23,[27][28][29]. Only the downstream information are added in [21,28,29] to create the state for action prediction.…”
Section: Coordination Of Intersectionsmentioning
confidence: 99%
See 2 more Smart Citations
“…It is the trade-off of the two extremes (centralised and decentralised) formally called as DCEE framework [45]. The adjacent agents information are coordinated for better traffic flows in [21,23,[27][28][29]. Only the downstream information are added in [21,28,29] to create the state for action prediction.…”
Section: Coordination Of Intersectionsmentioning
confidence: 99%
“…The adjacent agents information are coordinated for better traffic flows in [21,23,[27][28][29]. Only the downstream information are added in [21,28,29] to create the state for action prediction. All neighbouring agents' information are added in [22,23] and neighbours hidden states are added in [46].…”
Section: Coordination Of Intersectionsmentioning
confidence: 99%
See 1 more Smart Citation
“…In [19] intelligent traffic lights learn via reinforcement learning the optimal signal plan, and also adapt to changes in traffic patterns by a meaning of context detection.…”
Section: Related Workmentioning
confidence: 99%
“…Based on this hypothesis, we propose, formalize and show the efficiency of a method called RL-CD, or Reinforcement Learning with Context Detection, which performs well in non-stationary environments by continuously evaluating the prediction quality of each partial model. A brief discussion of how to measure prediction quality, in order to deal with non-stationary environments, can also be found in (Silva et al, 2006). This paper is organized as follows.…”
Section: Introductionmentioning
confidence: 99%