2022
DOI: 10.1016/j.neunet.2022.03.037
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning, reinforcement learning, and world models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
90
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 223 publications
(91 citation statements)
references
References 51 publications
0
90
0
1
Order By: Relevance
“…e essence of oral English pronunciation recognition based on machine learning [12][13][14] is to classify and recognize each word in the input audio. e essence of oral English pronunciation recognition based on deep learning [15][16][17] is the same. e difference is that the process is different.…”
Section: Introductionmentioning
confidence: 99%
“…e essence of oral English pronunciation recognition based on machine learning [12][13][14] is to classify and recognize each word in the input audio. e essence of oral English pronunciation recognition based on deep learning [15][16][17] is the same. e difference is that the process is different.…”
Section: Introductionmentioning
confidence: 99%
“…Nevertheless, because of the shortcoming that it is very sensitive to its initialization process and training process and easy to overfit (Rothmann and Porrmann, 2022;Matsuo et al, 2022), reinforcement learning should be improved further around the issue of this article. In this respect, the random forest algorithm could be introduced to solve the overfitting problem (Ouadah et al, 2022;Fei et al, 2022).…”
Section: Methods Proposition 41 Algorithm Outlinementioning
confidence: 99%
“…Nevertheless, because of the shortcoming that it is very sensitive to its initialization process and training process and easy to overfit (Rothmann and Porrmann, 2022; Matsuo et al. , 2022), reinforcement learning should be improved further around the issue of this article.…”
Section: Methods Propositionmentioning
confidence: 99%
“…Another negative penalty, proportional to the absolute difference between the current and previous actions (as indicated in (7)) is inflicted on the agent to discourage commanding largely different actions, particularly in opposing directions, which results in an oscillatory behavior of the physical platform.…”
Section: E Reward Designmentioning
confidence: 99%
“…Remarkable progress has been registered in developing RL algorithms for various robotic applications including, but not limited to, manipulation [3], navigation [4], [5], and control [6]. Reinforcement learning (RL) is a machine learning paradigm that relies on a rewarding system to train a set of neural networks to make sequential decisions to execute a particular task [7]. More specifically, an RL agent first receives a set of observations parameterizing the state of the environment in which the task is being carried out.…”
Section: Introductionmentioning
confidence: 99%