1989
DOI: 10.1109/9.24227
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive aggregation methods for infinite horizon dynamic programming

Abstract: In this paper we propose an on-line policy iteration (PI) algorithm for finite-state infinite horizon discounted dynamic programming, whereby the policy improvement operation is done on-line, only for the states that are encountered during operation of the system. This allows the continuous updating/improvement of the current policy, thus resulting in a form of on-line PI that incorporates the improved controls into the current policy as new states and controls are generated. The algorithm converges in a finit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
100
0

Year Published

1994
1994
2011
2011

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 140 publications
(100 citation statements)
references
References 18 publications
0
100
0
Order By: Relevance
“…Bertsekas and Castafion (1989) developed an adaptive aggregation scheme for use with the policy iteration algorithm. Rather than relying on feature extraction, this approach automatically and adaptively aggregates states during the course of an algorithm based on probability transition matrices under greedy policies.…”
Section: Related Workmentioning
confidence: 99%
“…Bertsekas and Castafion (1989) developed an adaptive aggregation scheme for use with the policy iteration algorithm. Rather than relying on feature extraction, this approach automatically and adaptively aggregates states during the course of an algorithm based on probability transition matrices under greedy policies.…”
Section: Related Workmentioning
confidence: 99%
“…The aggregation is done in order to reduce the size of the state space. Some algorithms of this type include those in Hinderer (1978), Mendelssohn (1982), Bean et al (1987) (which is for deterministic dynamic programs only), and Bertsekas and Castanon (1989). Morin (1978) is a general survey paper of the older literature.…”
Section: Hierarchical Aggregation For Problems With Large Attribute Smentioning
confidence: 99%
“…The problem with aggregation is that the right level changes with the number of times you observe a set of states. Some authors have suggested changing the level of aggregation with the number of iterations (Bertsekas and Castanon (1989), Luus (2000)). …”
Section: Multilevel Aggregationmentioning
confidence: 99%