Estudos Teórico-Metodológicos Nas Ciências Exatas, Tecnológicas E Da Terra 2 2020
DOI: 10.22533/at.ed.5172010085
|View full text |Cite
|
Sign up to set email alerts
|

Actor-Critic Reinforcement Learning to Traction Control of an Electrical Vehicle

Abstract: Direitos para esta edição cedidos à Atena Editora pelos autores. Todo o conteúdo deste livro está licenciado sob uma Licença de Atribuição Creative Commons. Atribuição 4.0 Internacional (CC BY 4.0). O conteúdo dos artigos e seus dados em sua forma, correção e confiabilidade são de responsabilidade exclusiva dos autores, inclusive não representam necessariamente a posição oficial da Atena Editora. Permitido o download da obra e o compartilhamento desde que sejam atribuídos créditos aos autores, mas sem a possib… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(7 citation statements)
references
References 0 publications
0
7
0
Order By: Relevance
“…For the proposed controller, which avoids the inadequate sleep of the wheels, the predicted future rewards do not have a significant influence on the current action. Furthermore, the evaluations of the discount factor researched by Funk Drechsler et al [1] indicate better behavior of myopic training processes for this specific implementation.…”
Section: Value Function Suppressionmentioning
confidence: 86%
See 4 more Smart Citations
“…For the proposed controller, which avoids the inadequate sleep of the wheels, the predicted future rewards do not have a significant influence on the current action. Furthermore, the evaluations of the discount factor researched by Funk Drechsler et al [1] indicate better behavior of myopic training processes for this specific implementation.…”
Section: Value Function Suppressionmentioning
confidence: 86%
“…The implemented critic is composed of a network with two hidden layers and twenty nodes in each layer, while the actor has the same number of hidden layers and just twelve hidden nodes in each layer. The network architecture including input and output data is the same applied by Funk Drechsler et al [1]. The critic and actor networks use the Tangent-Sigmoid activation function in the hidden layers and linear function in the output layer.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations