2011
DOI: 10.1080/01969722.2011.634681
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Avoid Risky Actions

Abstract: When a reinforcement learning agent executes actions that can cause frequently damages to itself, it can learn, by using Q-learning, that these actions must not be executed again. However, there are other actions that do not cause damage frecuently, only once in a while: risky actions, such as parachuting. These actions may imply a big punishment to the agent and, depending on its personality, it would be better to avoid. Nevertheless, using the standard Q-learning algorithm the agent is not able to learn to a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…Maggie is a social and personal robot intended to perform research on HRI and improving robots autonomy (Figure 1). It is controlled by the Automatic-Deliberative architecture (Barber and Salichs 2002;Barber 2000;Barber and Salichs 2001;Rivas, Corrales, Barber, and Salichs 2007;Malfaz and Salichs 2011) where the elemental component is the skill. Skills endow the robot with different sensory and motor capacities, and process information.…”
Section: The Robot Maggie and Its Decision Making Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…Maggie is a social and personal robot intended to perform research on HRI and improving robots autonomy (Figure 1). It is controlled by the Automatic-Deliberative architecture (Barber and Salichs 2002;Barber 2000;Barber and Salichs 2001;Rivas, Corrales, Barber, and Salichs 2007;Malfaz and Salichs 2011) where the elemental component is the skill. Skills endow the robot with different sensory and motor capacities, and process information.…”
Section: The Robot Maggie and Its Decision Making Systemmentioning
confidence: 99%
“…In this approach, the external state considers each object separately (Castro-González, Malfaz, and Salichs 2011).…”
Section: A the Reduced State Spacementioning
confidence: 99%
“…Our work is also related to safe reinforcement learning (García and Fernández 2015), which also aims to identify risky states. Many such methods aim to optimise a riskaverse objective (Bertsekas and Rhodes 1971;Heger 1994;Malfaz and Salichs 2011), whereas we aim to more efficiently optimise a risk-neutral objective (expected return). Other methods aim to constrain exploration to avoid risky states (Gehring and Precup 2013), whereas we learn in a safe simulator and thus seek proposal distributions that visit such states more often, if they are significant to expected return.…”
Section: Related Workmentioning
confidence: 99%
“…you are afraid of walking a tightrope). Risky actions have already been studied in virtual agents [35] and they will be considered in our robot in future works.…”
Section: General Aspects Of Fearmentioning
confidence: 99%