2019
DOI: 10.1109/thms.2019.2912447
|View full text |Cite
|
Sign up to set email alerts
|

Human-Centered Reinforcement Learning: A Survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
40
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 94 publications
(40 citation statements)
references
References 46 publications
0
40
0
Order By: Relevance
“…Integrating evaluative feedback from humans into RL is sometimes called human-centered RL. A survey on humancentered RL topic is provided by [37]. One important model to mention is TAMER [38], [39].…”
Section: Related Workmentioning
confidence: 99%
“…Integrating evaluative feedback from humans into RL is sometimes called human-centered RL. A survey on humancentered RL topic is provided by [37]. One important model to mention is TAMER [38], [39].…”
Section: Related Workmentioning
confidence: 99%
“…In the first one, called reward-shaping, the trainer modifies or accepts the reward given by the environment in order to bias the agent's learning [19,20]. In the second one, called policy-shaping, the trainer may suggest a different action to perform, by replacing the one proposes by the policy [21,22]. A simple policy-shaping method involves forcing the agent to take certain actions that are recommended by the trainer [23,24].…”
Section: Deep Reinforcement Learning and Interactive Feedbackmentioning
confidence: 99%
“…An inefficient reward function means that the controller needs a lot of learning samples and time to test and explore, which seriously limits the application of the traditional reinforcement learning method to the actual robot system. Therefore, on the basis of traditional reinforcement learning, researchers put forward interactive reinforcement learning [7], [14], [16], [17], [19], [20], [28]- [30]. In interactive reinforcement learning, an agent learns in an MDP without reward function.…”
Section: A Deep Q-networkmentioning
confidence: 99%
“…In order to speed up the robot learning, researchers propose interactive reinforcement learning (IRL) [7] based on reward shaping [8] in traditional reinforcement learning methods. Interactive reinforcement learning allows designers and even non-technical personnel to train robots by evaluating their behavior.…”
Section: Introductionmentioning
confidence: 99%