2017
DOI: 10.22146/jnteti.v6i1.287
|View full text |Cite
|
Sign up to set email alerts
|

Implementasi Q-Learning dan Backpropagation pada Agen yang Memainkan Permainan Flappy Bird

Abstract: Penggunaan value-function approximation diharapkan bisa mempercepat waktu pembelajaran dan mengurangi bobot yang disimpan, karena dari hasil penelitian sebelumnya dibutuhkan waktu yang lama dan banyaknya bobot yang disimpan di memori ketika hanya digunakan reinforcement learning saja. Arsitektur artificial neural network (ANN) yang digunakan adalah satu ANN pada masing-masing kemungkinan aksi. Berdasarkan hasil pengujian, diperoleh kesimpulan bahwa implementasi Q-learning yang dikombinasikan dengan backpropaga… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 2 publications
0
5
0
Order By: Relevance
“…The Qlearning with backpropagation required an average training time of 9 minutes and 1 second, while classical Q-learning took 120 minutes. Therefore, Q-learning with backpropagation was 92% faster than classical Q-learning with similar performance [14].…”
Section: Related Workmentioning
confidence: 93%
See 2 more Smart Citations
“…The Qlearning with backpropagation required an average training time of 9 minutes and 1 second, while classical Q-learning took 120 minutes. Therefore, Q-learning with backpropagation was 92% faster than classical Q-learning with similar performance [14].…”
Section: Related Workmentioning
confidence: 93%
“…In Equation 1, there is a learning rate (α) that usually takes values between 0 and 1. The learning rate parameter (α) signifies the rate of change from the old Q-value to be replaced by the new Q-value [14], [18]. A smaller learning rate implies a slower change in Q-value, indicating that the agent is cautious in updating Q-values.…”
Section: Q-learning Implementationmentioning
confidence: 99%
See 1 more Smart Citation
“…The Deep Q Network (DQN) is a reinforcement learning algorithm developed to overcome complex problems in machine learning [20]. This algorithm is a combination of reinforcement learning with a deep artificial neural network [21].…”
Section: Deep Q Network Algorithmmentioning
confidence: 99%
“…First is Feed-forward, which is the pattern training process that will set to each unit in the input layer, then output the one generated is transmitted to the next layer, continue until the output layer. Second is backpropagation, which is the process of adjusting each weight based on the expected output, to be produced minimal error, starting from the weight connected to the output neuron, then continue to retreat until to the input layer [10].…”
Section: Identification Of Varieties Using Backpropagationmentioning
confidence: 99%