Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569)
DOI: 10.1109/nafips.2001.944312
|View full text |Cite
|
Sign up to set email alerts
|

Dual heuristic programming for fuzzy control

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 16 publications
0
10
0
Order By: Relevance
“…In Dual Heuristic Programming (DHP), critic's output is a derivative of the value function with respect to the state. DHP has been shown to converge to the optimal solution more rapidly than HDP [13], but use of derivatives leads to a complex relationship for updating control and value functional derivatives and poses a function approximation challenge. To overcome these difficulties, Global Dual Heuristic Programming (GDHP) has been proposed that combines advantages of both DHP and HDP [14].…”
Section: Model Based Rlmentioning
confidence: 99%
See 1 more Smart Citation
“…In Dual Heuristic Programming (DHP), critic's output is a derivative of the value function with respect to the state. DHP has been shown to converge to the optimal solution more rapidly than HDP [13], but use of derivatives leads to a complex relationship for updating control and value functional derivatives and poses a function approximation challenge. To overcome these difficulties, Global Dual Heuristic Programming (GDHP) has been proposed that combines advantages of both DHP and HDP [14].…”
Section: Model Based Rlmentioning
confidence: 99%
“…Design difficulties of Markov game based control 1. A nontrivial design consideration is the use of LP in the proposed approach for solving the control-disturber game to update the Q-values (13). In general, LP converges slowly which implies that computational complexity of the approach would increase with the dimensionality of the state-action space.…”
Section: Game Theory Based Solution To Dec-pomdpmentioning
confidence: 99%
“…A family of ADP structures was proposed by Werbos in the early 1990's [68], [69], and has been widely used by others [16], [17], [18], [19], [20], [23], [24], [25], [26], [27], [28], [36], [45], [46], [47], [51], [53], [54]. While the original formulation was based on neural network implementations, it was noted that any learning structure capable of implementing the appropriate mathematics would work.…”
Section: Adaptive Critics To Adpmentioning
confidence: 99%
“…While the original formulation was based on neural network implementations, it was noted that any learning structure capable of implementing the appropriate mathematics would work. Fuzzy Logic structures would be a case in point; examples may be found in [27], [52], [57], [61], [63]. Werbos' family (also called "ladder") of ADP structures includes: Heuristic Dynamic Programming (HDP), Dual Heuristic Programming (DHP), and Global Dual Heuristic Programming (GDHP).…”
Section: Adaptive Critics To Adpmentioning
confidence: 99%
“…Simple gradient descent was used to train the critic and controller consequent parameters only (see [6] [11] for details on the DHP algorithm and training equations for fuzzy structures). Perturbation through the plant model was used to obtain plant derivatives.…”
Section: Dhp Setup and Training Processmentioning
confidence: 99%