2022
DOI: 10.1007/s00521-021-06850-6
|View full text |Cite
|
Sign up to set email alerts
|

Human engagement providing evaluative and informative advice for interactive reinforcement learning

Abstract: Interactive reinforcement learning proposes the use of externally sourced information in order to speed up the learning process. When interacting with a learner agent, humans may provide either evaluative or informative advice. Prior research has focused on the effect of human-sourced advice by including real-time feedback on the interactive reinforcement learning process, specifically aiming to improve the learning speed of the agent, while minimising the time demands on the human. This work focuses on answer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 43 publications
0
2
0
Order By: Relevance
“…Interactive reinforcement learning (IntRL) is an intriguing domain integrating a trainer to guide or evaluate the behavior of a learning agent [32,33]. The trainer's advice reinforces the agent's learning method and molds the exploration strategy, resulting in sample-efficient training by reducing the search spaces of states and actions [34]. Millan-Arias et al [35] developed an actor-critic RL agent that trained in a disturbed environment while receiving advice, allowing the agent to learn more robust policies.…”
Section: Related Workmentioning
confidence: 99%
“…Interactive reinforcement learning (IntRL) is an intriguing domain integrating a trainer to guide or evaluate the behavior of a learning agent [32,33]. The trainer's advice reinforces the agent's learning method and molds the exploration strategy, resulting in sample-efficient training by reducing the search spaces of states and actions [34]. Millan-Arias et al [35] developed an actor-critic RL agent that trained in a disturbed environment while receiving advice, allowing the agent to learn more robust policies.…”
Section: Related Workmentioning
confidence: 99%
“…To solve the data querying problem, RL methods are still required. Methods without using IRL, but still infusing human supervision in RL setting, are proposed in [ 36 , 37 ]. Here, instead of obtaining the expert knowledge beforehand, a human-in-the-loop approach is used to query an expert for advice on action and reward.…”
Section: Related Workmentioning
confidence: 99%