SICE Annual Conference 2007 2007
DOI: 10.1109/sice.2007.4421459
|View full text |Cite
|
Sign up to set email alerts
|

Proposal and evaluation of the penalty avoiding rational policy making algorithm with penalty level

Abstract: Reinforcement learning (RL) is a kind of machine learning. It aims to adapt an agent to a given environment by utilizing a reward and a penalty. We know the Penalty Avoiding Rational Policy Making algorithm (PARP) [5] and the Penalty Avoiding Profit Sharing (PAPS) [6] as examples of RL systems that are able to suppress a penalty and learn a rational policy. However they cannot treat multiple penalties. In this paper, we extend PARP/PAPS to the environments where there are some kinds of penalties. We propose th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 17 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?