2024
DOI: 10.1613/jair.1.15348
|View full text |Cite
|
Sign up to set email alerts
|

Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities

Carl Orge Retzlaff,
Srijita Das,
Christabel Wayllace
et al.

Abstract: Artificial intelligence (AI) and especially reinforcement learning (RL) have the potential to enable agents to learn and perform tasks autonomously with superhuman performance. However, we consider RL as fundamentally a Human-in-the-Loop (HITL) paradigm, even when an agent eventually performs its task autonomously.  In cases where the reward function is challenging or impossible to define, HITL approaches are considered particularly advantageous. The application of Reinforcement Learning from Human Feedback (R… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(2 citation statements)
references
References 120 publications
0
2
0
Order By: Relevance
“…This must go hand in hand with the development of a comprehensive knowledge graph, which holds significant potential for advancing our understanding of the causal relationships underlying forest accidents. The incorporation of the human-in-the-loop approach [55] promises to enhance the relevance and applicability of AI solutions in this context. In future research, we aim to integrate counterfactual explanations [56] for analyzing forestry accidents.…”
Section: Discussionmentioning
confidence: 99%
“…This must go hand in hand with the development of a comprehensive knowledge graph, which holds significant potential for advancing our understanding of the causal relationships underlying forest accidents. The incorporation of the human-in-the-loop approach [55] promises to enhance the relevance and applicability of AI solutions in this context. In future research, we aim to integrate counterfactual explanations [56] for analyzing forestry accidents.…”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, the concept of HITL is used to denote human participation in the process of aiding the computer to make accurate decisions during model development [24]. HITL enhances the ML process compared to random sampling by selecting the most important data to achieve the best results [25,26].…”
Section: Introductionmentioning
confidence: 99%