2009
DOI: 10.1007/s10489-009-0191-x
|View full text |Cite
|
Sign up to set email alerts
|

Microassembly path planning using reinforcement learning for improving positioning accuracy of a 1 cm3 omni-directional mobile microrobot

Abstract: This paper introduces the path planning of a 1 cm 3 mobile microrobot that is designed for microassembly in a microfactory. Since the conventional path planning method can not achieve high microassembly positioning accuracy, a supervised learning assisted reinforcement learning (SL-RL) method has been developed. In this mixed learning method, the reinforcement learning (RL) is used to search a movement path in the normal learning area. But when the microrobot moves into the buffer area, the supervised learning… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
11
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 27 publications
0
11
0
Order By: Relevance
“…We will justify this with the following experimental results that are based on the benchmark environments shown in Table 2. Table 3 shows the comparison between the t 0 calculated from (18) and the average number of iterations needed for the LA to converge, in practice. As the CPA and ACPA are well-established algorithms, we know that numerous experiments have been conducted to confirm their validity, and so we merely use the figures from [30] to record the practical results.…”
Section: Remarkmentioning
confidence: 99%
See 1 more Smart Citation
“…We will justify this with the following experimental results that are based on the benchmark environments shown in Table 2. Table 3 shows the comparison between the t 0 calculated from (18) and the average number of iterations needed for the LA to converge, in practice. As the CPA and ACPA are well-established algorithms, we know that numerous experiments have been conducted to confirm their validity, and so we merely use the figures from [30] to record the practical results.…”
Section: Remarkmentioning
confidence: 99%
“…LA have also been used in natural language processing, string taxonomy [12], graph patitioning [13], map learning [14], service selection in stochastic environments [15], numerical optimization [16], web crawling [17], microassembly path planning [18], multiagent learning [19], and in batch sequencing and sizing in just-in-time manufacturing systems [20].…”
mentioning
confidence: 99%
“…They have been used in game playing [4][5][6][7][8][9][10], parameter optimization [11,12], channel selection in cognitive radio networks [13], assigning capacities in prioritized networks [14], solving knapsack problems [15], optimizing the web polling problem [16,17], stochastically optimally allocating limited resources [15,18,19], service selection in stochastic environments [20], numerical optimization [21], web crawling [22], microassembly path planning [23], multiagent learning [24], and in batch sequencing and sizing in just-intime manufacturing systems [25]. An asynchronous actionreward learning has been used for nonstationary serial supply chain inventory control [26].…”
Section: Learning Automata: Concept and Their Applicationsmentioning
confidence: 99%
“…The RL method is also frequently applied in various engineering tasks. It is used in nonstationary serial supply chain inventory control [18], adaptive control of nonlinear objects [43], adjusting robot behavior for autonomous navigation system [26] or path planning for improving positioning accuracy of a mobile microrobot [22]. There are also studies which propose the use of RL in non-technical domains, e.g.…”
Section: Introductionmentioning
confidence: 99%