2016 3rd Conference on Control and Fault-Tolerant Systems (SysTol) 2016
DOI: 10.1109/systol.2016.7739765
|View full text |Cite
|
Sign up to set email alerts
|

Temporal-difference Q-learning in active fault diagnosis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 20 publications
0
6
0
Order By: Relevance
“…There are two types of the tasks involved in RL. One is 'prediction,' and the other is 'control' [131]. The prediction task predicts the total expected reward from any given state, assuming function policy π(a|s) is given.…”
Section: Figure 210 the Rl Modelmentioning
confidence: 99%
“…There are two types of the tasks involved in RL. One is 'prediction,' and the other is 'control' [131]. The prediction task predicts the total expected reward from any given state, assuming function policy π(a|s) is given.…”
Section: Figure 210 the Rl Modelmentioning
confidence: 99%
“…when s ̸ = 0, the identifier (10) represents the fault identifier, and when s = 0, the identifier (10) represents the normal identifier. Compare (10) with (9), and define…”
Section: Dlt-based System Identificationmentioning
confidence: 99%
“…The critic NN is applied to approximate the strategic utility function, that is, to estimate the long-term system performance [18] [19]. Interesting results about the ADP/actor-critic methods for the dynamical system have been reported in the recent literatures, and have achieved excellent performance in FD [10]- [13]. For example, in [10], inspired from the ADP techniques, an active fault detector is designed for a class of discrete-time system, where a reward function is set to impose the penalty of the wrong fault detection decisions.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Skach et al [12], have utilized the RL method for fault detection and isolation in a system with the uncertainty. Yih et al [13] have combined the RL and Fuzzy Logic to detect system faults.…”
Section: Introductionmentioning
confidence: 99%