2018
DOI: 10.1016/j.robot.2017.11.010
|View full text |Cite
|
Sign up to set email alerts
|

Reset-free Trial-and-Error Learning for Robot Damage Recovery

Abstract: The high probability of hardware failures prevents many advanced robots (e.g., legged robots) from being confidently deployed in real-world situations (e.g., post-disaster rescue). Instead of attempting to diagnose the failures, robots could adapt by trial-and-error in order to be able to complete their tasks. In this situation, damage recovery can be seen as a Reinforcement Learning (RL) problem. However, the best RL algorithms for robotics require the robot and the environment to be reset to an initial state… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
87
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 94 publications
(87 citation statements)
references
References 58 publications
0
87
0
Order By: Relevance
“…Algorithms based on behavior performance maps [Chatzilygeroudis et al 2018;Cully et al 2015] rely on the assumption that knowledge of the cause of damage i.e., a proper diagnosis report is not necessary to recover from the damage. Rather than considering two separate phases for damage diagnosis and recovery algorithm generation, Cully et al [Cully et al 2015], proposed a method inspired from animals, who perform trial and error to determine the least painful alternate gait in the presence of injury.…”
Section: Map-based Algorithms For Adaptationmentioning
confidence: 99%
See 1 more Smart Citation
“…Algorithms based on behavior performance maps [Chatzilygeroudis et al 2018;Cully et al 2015] rely on the assumption that knowledge of the cause of damage i.e., a proper diagnosis report is not necessary to recover from the damage. Rather than considering two separate phases for damage diagnosis and recovery algorithm generation, Cully et al [Cully et al 2015], proposed a method inspired from animals, who perform trial and error to determine the least painful alternate gait in the presence of injury.…”
Section: Map-based Algorithms For Adaptationmentioning
confidence: 99%
“…Deep Reinforcement learning (Deep RL) has been shown to be effective in modeling such navigation problems because of both its online and offline learning capabilities in high dimensional search spaces [Chatzilygeroudis et al 2018;Hwangbo et al 2017;Lobos-Tsunekawa et al 2018;Pinto et al 2017a]. In the context of adapting Authors' addresses: Shresth Verma, ABV-Indian Institute of Information Technology and Management, Gwalior, vermashresth@gmail.com; Haritha S. Nair, ABV-Indian Institute of Information Technology and Management, Gwalior, haritha1313@gmail.com; Gaurav Agarwal, ABV-Indian Institute of Information Technology and Management, Gwalior, gaurava05@gmail.com; Joydip Dhar, ABV-Indian Institute of Information Technology and Management, Gwalior, jdhar@iiitm.ac.in; Anupam Shukla, ABV-Indian Institute of Information Technology and Management, Gwalior, anupamshukla@iiitm.…”
Section: Introductionmentioning
confidence: 99%
“…In a game-theoretical setting, the FPCP can be posed as follows. (2). After this step, the game proceeds the same as Game 1.…”
Section: Game Formulationmentioning
confidence: 99%
“…An autonomous system operating for substantial periods of time in remote, unknown, or hostile environment will inevitably sustain damage or experience partial system failures over time due to malfunctions. Examples include unmanned aerial vehicles (UAVs) operating over contested territory [1], search-and-rescue robots [2], and rovers performing missions on extraterrestrial surfaces [3]. (b) Hostile takeover.…”
Section: Introductionmentioning
confidence: 99%
“…Indeed, much work has investigated how, in the absence of external supervision, a robot can automatically learn new ways to control its body when damaged [6,10,13,20,24,26,34]. While a diverse set of recovery mechanisms have been proposed, they all shared a common assumption: The damaged mechanical structure could be reconfigured, but not fundamentally deformed.…”
Section: Introductionmentioning
confidence: 99%