2009 IEEE Symposium on Industrial Electronics &Amp; Applications 2009
DOI: 10.1109/isiea.2009.5356361
|View full text |Cite
|
Sign up to set email alerts
|

Utilization of Webots and Khepera II as a platform for Neural Q-Learning controllers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 6 publications
0
9
0
Order By: Relevance
“…Pandey et al [21], designed sensor integration based fuzzy logic controller to create a collision-free path, using webots simulator to check simulation result. In [22] [23], webots and Khepera II miniature mobile robot utilized as a platform for the investigation of Neurol Q-Learning controller. Almasri et al [24] present a collision-free mobile robot navigation using fuzzy logic fusion model.…”
Section: Related Workmentioning
confidence: 99%
“…Pandey et al [21], designed sensor integration based fuzzy logic controller to create a collision-free path, using webots simulator to check simulation result. In [22] [23], webots and Khepera II miniature mobile robot utilized as a platform for the investigation of Neurol Q-Learning controller. Almasri et al [24] present a collision-free mobile robot navigation using fuzzy logic fusion model.…”
Section: Related Workmentioning
confidence: 99%
“…However, Neural Networks (NN), one of the most common form of function approximators, has not been extensively used in conjunction with RL. Although some successful applications have been reported [2] [3][4] [5], most of them choose sequential learning method for an obvious reason. Because of the on-line nature of RL, experience samples come in sequentially and thus incremental learning NN becomes an apparent choice.…”
Section: Introductionmentioning
confidence: 99%
“…It can be randomly initialized and then the weights of NN be tuned during interaction with the environment [3], but it may require a long time before NN can converge to an appropriate form. Another way is to first use tabular method to derive the training data and train a preliminary form of NN.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Robot path planning has been an active research area, and many methods have been developed to tackle this problem, such as A* algorithm [1]- [2], D* algorithm [3]- [4], reinforcement learning [5]- [6], Q-Learning [7]- [10], global Cspace methods [11], potential field methods [12], fuzzy logic [13]- [15] and neural networks [16]- [17]. Each method has its own strength over others in certain aspects.…”
Section: Introductionmentioning
confidence: 99%