2005
DOI: 10.1007/11427469_34
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent Fuzzy Q-Learning Control of Humanoid Robots

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2012
2012
2015
2015

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 7 publications
0
1
0
Order By: Relevance
“…The reward evaluation can be defined by scaled values as "-1" for failure, "+1" for success and "0" for neutral. Moreover, continuous functions and fuzzy evaluation can also be adopted for the reward function as well [131,132]. If an action is followed by a satisfactory state of affairs or an improvement in the state of affairs, the tendency to produce that action is strengthened or reinforced (rewarded).…”
Section: Basic Elements Of Reinforcement Learning Systemmentioning
confidence: 99%
“…The reward evaluation can be defined by scaled values as "-1" for failure, "+1" for success and "0" for neutral. Moreover, continuous functions and fuzzy evaluation can also be adopted for the reward function as well [131,132]. If an action is followed by a satisfactory state of affairs or an improvement in the state of affairs, the tendency to produce that action is strengthened or reinforced (rewarded).…”
Section: Basic Elements Of Reinforcement Learning Systemmentioning
confidence: 99%