2022
DOI: 10.7557/18.6237
|View full text |Cite
|
Sign up to set email alerts
|

Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples

Abstract: In this article, we propose a novel algorithm for deep reinforcement learning named Expert Q-learning. Expert Q-learning is inspired by Dueling Q-learning and aims to incorporate semi-supervised learning into reinforcement learning through splitting Q-values into state values and action advantages. We require that an offline expert assesses the value of a state in a coarse manner using three discrete values. An expert network is designed in addition to the Q-network, which updates each time following the regul… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…It is noteworthy that the aggregation of the key does not invariably require a linear output layer. A Minkowski sum of embeddings has been demonstrated as an alternative approach that can competently generate effective aggregated representations (Meng et al, 2023). Building on this insight, we have crafted an additional aggregation strategy that employs a straightforward averaging operation of K with H charts, shown in Eq.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…It is noteworthy that the aggregation of the key does not invariably require a linear output layer. A Minkowski sum of embeddings has been demonstrated as an alternative approach that can competently generate effective aggregated representations (Meng et al, 2023). Building on this insight, we have crafted an additional aggregation strategy that employs a straightforward averaging operation of K with H charts, shown in Eq.…”
Section: Methodsmentioning
confidence: 99%
“…modules. Our approach is inspired by the unbalanced atlas (UA) paradigm (Korman, 2021;Meng et al, 2023) from the area of self-supervised learning (SSL). In our methodology, the key's weights have been uncoupled from the weights of the query and value, which are instead characterized as the charts of a manifold.…”
Section: Introductionmentioning
confidence: 99%
“…Another application utilizes the transformer model in RL as the replacement of convolutional layers for feature extraction. This is the case of the Swin Transformer model used for image processing [12]. It differs from the present paper, which does not incorporate the entire game screen as part of its input.…”
Section: Introductionmentioning
confidence: 92%
“…Regarding interactive modeling concepts, Deep Reinforcement Learning (DRL) [18] offers a novel approach to knowledge inference. The Deeppath model, which is widely used, considers the entities of knowledge as the state spaces and navigates between them by selecting relations.…”
Section: Kg Reasoning On Neural Networkmentioning
confidence: 99%