2017 IEEE 15th International Conference on Industrial Informatics (INDIN) 2017
DOI: 10.1109/indin.2017.8104770
|View full text |Cite
|
Sign up to set email alerts
|

An application of reinforcement learning algorithms to industrial multi-robot stations for cooperative handling operation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(3 citation statements)
references
References 12 publications
0
3
0
Order By: Relevance
“…e application of manipulators makes production more efficient and flexible. e core issue of automatic manipulator tracking control is how to ensure the given moving target follows the expected trajectory and adapts to various uncertain factors [7][8][9][10][11]. It is of great practical significance to derive an automatic tracking control strategy for moving targets under uncertainties and external interference.…”
Section: Introductionmentioning
confidence: 99%
“…e application of manipulators makes production more efficient and flexible. e core issue of automatic manipulator tracking control is how to ensure the given moving target follows the expected trajectory and adapts to various uncertain factors [7][8][9][10][11]. It is of great practical significance to derive an automatic tracking control strategy for moving targets under uncertainties and external interference.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning applied to multi-agent systems has two dimensions: DRL algorithms that model policies for multiagent control and interaction, and DRL approaches that rely on multiple agents to parallelize the learning process or explore a wider variety of experiences. Within the former category, we can find examples of DRL for formation control [6], obstacle and collision avoidance [7], [8], collaborative assembly [9], or cooperative multi-agent control in general [10]. In the latter category, most existing approaches refer to the utilization of multiple agents to learn in parallel, but from the point of view of a multi-process or multi-threaded application [3].…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, the RL algorithm is used to optimise the weighting parameters using the method of mutual learning between weight parameters and DMR. RL has been applied to electric vehicle dispatching [21], transportation planning [22], power systems [23], robot control [24,25], and other problems. It has achieved impressive results in the fields of unmanned cars [26] and games [27,28].…”
Section: Introductionmentioning
confidence: 99%