2015
DOI: 10.1109/tcst.2014.2312392
|View full text |Cite
|
Sign up to set email alerts
|

Multirobot Cooperative Learning for Predator Avoidance

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
69
0
1

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 128 publications
(70 citation statements)
references
References 32 publications
0
69
0
1
Order By: Relevance
“…To ensure a stable training, in our experiemnt we use a purely exploratory policy for the first 10000 time steps then begin to train the networks. (d) The training of the critic network involves the computation of Q-value error δ j and the gradient ∇ ω q(s j , u j |ω) according to (18) and (23). This is easy to implement using the back propagation algorithms.…”
Section: A Implementation Detailsmentioning
confidence: 99%
See 1 more Smart Citation
“…To ensure a stable training, in our experiemnt we use a purely exploratory policy for the first 10000 time steps then begin to train the networks. (d) The training of the critic network involves the computation of Q-value error δ j and the gradient ∇ ω q(s j , u j |ω) according to (18) and (23). This is easy to implement using the back propagation algorithms.…”
Section: A Implementation Detailsmentioning
confidence: 99%
“…Under the dynamic programming framework, the RL aims to solve Markov decision process (MDP) problems by solely using training samples of the decision process. It is a model-free method, and has been applied successfully for the robotic control, including the path planning for a mobile robot [15] or multirobot [16], the vision-based robotic motion control [17], the multirobot cooperation [18], the high-precision motion control [19], the autonomous underwater vehicle (AUV) [20], etc.…”
Section: Introductionmentioning
confidence: 99%
“…Multi-robot systems often require forms of cooperation, collaboration or co-ordination among the entities, such as collision-avoidance (Remmersmann et al, 2010) or flocking control (La et al, 2015), and for those inter-robot interaction is needed. Robots may possess various levels of cognition; see for example (Celentano, 2014).…”
Section: Control Of Cognitive Multi-robot Systemsmentioning
confidence: 99%
“…Q-learning methods have been applied on a variety of tasks by autonomous robots [1], and much research has been done in this field starting many years ago [2], with some work specific to continuous action spaces [3]- [6] and others on discrete action spaces [7]. Reinforcement Learning (RL) has been applied to locomotion [8] [9] and also to manipulation [10], [11].…”
Section: Introductionmentioning
confidence: 99%