Reset-free Trial-and-Error Learning for Robot Damage Recovery

Chatzilygeroudis, Konstantinos; Vassiliades, Vassilis; Mouret, Jean-Baptiste

doi:10.1016/j.robot.2017.11.010

Cited by 94 publications

(87 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Algorithms based on behavior performance maps [Chatzilygeroudis et al 2018;Cully et al 2015] rely on the assumption that knowledge of the cause of damage i.e., a proper diagnosis report is not necessary to recover from the damage. Rather than considering two separate phases for damage diagnosis and recovery algorithm generation, Cully et al [Cully et al 2015], proposed a method inspired from animals, who perform trial and error to determine the least painful alternate gait in the presence of injury.…”

Section: Map-based Algorithms For Adaptationmentioning

confidence: 99%

“…Deep Reinforcement learning (Deep RL) has been shown to be effective in modeling such navigation problems because of both its online and offline learning capabilities in high dimensional search spaces [Chatzilygeroudis et al 2018;Hwangbo et al 2017;Lobos-Tsunekawa et al 2018;Pinto et al 2017a]. In the context of adapting Authors' addresses: Shresth Verma, ABV-Indian Institute of Information Technology and Management, Gwalior, vermashresth@gmail.com; Haritha S. Nair, ABV-Indian Institute of Information Technology and Management, Gwalior, haritha1313@gmail.com; Gaurav Agarwal, ABV-Indian Institute of Information Technology and Management, Gwalior, gaurava05@gmail.com; Joydip Dhar, ABV-Indian Institute of Information Technology and Management, Gwalior, jdhar@iiitm.ac.in; Anupam Shukla, ABV-Indian Institute of Information Technology and Management, Gwalior, anupamshukla@iiitm.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots

Verma

Nair

Agarwal

et al. 2020

Proceedings of the 7th ACM IKDD CoDS and 25th COMAD

View full text Add to dashboard Cite

Robotics has proved to be an indispensable tool in many industrial as well as social applications, such as warehouse automation, manufacturing, disaster robotics, etc. In most of these scenarios, damage to the agent while accomplishing mission-critical tasks can result in failure. To enable robotic adaptation in such situations, the agent needs to adopt policies which are robust to a diverse set of damages and must do so with minimum computational complexity. We thus propose a damage aware control architecture which diagnoses the damage prior to gait selection while also incorporating domain randomization in the damage space for learning a robust policy. To implement damage awareness, we have used a Long Short Term Memory based supervised learning network which diagnoses the damage and predicts the type of damage. The main novelty of this approach is that only a single policy is trained to adapt against a wide variety of damages and the diagnosis is done in a single trial at the time of damage.

show abstract

Section: Map-based Algorithms For Adaptationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots

Verma

Nair

Agarwal

et al. 2020

Proceedings of the 7th ACM IKDD CoDS and 25th COMAD

View full text Add to dashboard Cite

show abstract

“…In a game-theoretical setting, the FPCP can be posed as follows. (2). After this step, the game proceeds the same as Game 1.…”

Section: Game Formulationmentioning

confidence: 99%

“…An autonomous system operating for substantial periods of time in remote, unknown, or hostile environment will inevitably sustain damage or experience partial system failures over time due to malfunctions. Examples include unmanned aerial vehicles (UAVs) operating over contested territory [1], search-and-rescue robots [2], and rovers performing missions on extraterrestrial surfaces [3]. (b) Hostile takeover.…”

Section: Introductionmentioning

confidence: 99%

Graph-Based Controller Synthesis for Safety-Constrained, Resilient Systems

Bucić¹,

Ornik

Topcu

2018

2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

View full text Add to dashboard Cite

Resilience to damage, component degradation, and adversarial action is a critical consideration in design of autonomous systems. In addition to designing strategies that seek to prevent such negative events, it is vital that an autonomous system remains able to achieve its control objective even if the system partially loses control authority. While loss of authority limits the system's control capabilities, it may be possible to use the remaining authority in such a way that the system's control objectives remain achievable. In this paper, we consider the problem of optimal design for an autonomous system with discrete-time linear dynamics where the available control actions depend on adversarial input produced as a result of loss of authority. The central question is how to partition the set of control inputs that the system can apply in such a way that the system state remains within a safe set regardless of an adversarial input limiting the available control inputs to a single partition elements. We interpret such a problem first as a variant of a safety game, and then as a problem of existence of an appropriate edge labeling on a graph. We obtain conditions for existence and a computationally efficient algorithm for determining a system design and a control policy that preserve system safety. We illustrate our results on two examples: a damaged autonomous vehicle and a method of communication over a channel that ensures a minimal running digital sum.M. Bucić is with ETH Zürich. M. Ornik and U. Topcu are with the University of Texas at Austin.

show abstract

“…Indeed, much work has investigated how, in the absence of external supervision, a robot can automatically learn new ways to control its body when damaged [6,10,13,20,24,26,34]. While a diverse set of recovery mechanisms have been proposed, they all shared a common assumption: The damaged mechanical structure could be reconfigured, but not fundamentally deformed.…”

Section: Introductionmentioning

confidence: 99%

Automated Shapeshifting for Function Recovery in Damaged Robots

Kriegman¹,

Walker²,

Shah³

et al. 2019

Robotics: Science and Systems XV

View full text Add to dashboard Cite

Fig. 1. After learning to walk, a simulated quadruped is subjected to unanticipated insult: its legs are cut off. An evolutionary algorithm searches for deformations to the postdamage structure that, when coupled with the predamage controller, result in function recovery. One of the evolved solutions (shown here) yields the spontaneous "regeneration" of the lost legs, which was manually transferred to reality (youtu.be/afOXX2r54mQ).Abstract-A robot's mechanical parts routinely wear out from normal functioning and can be lost to injury. For autonomous robots operating in isolated or hostile environments, repair from a human operator is often not possible. Thus, much work has sought to automate damage recovery in robots. However, every case reported in the literature to date has accepted the damaged mechanical structure as fixed, and focused on learning new ways to control it. Here we show for the first time a robot that automatically recovers from unexpected damage by deforming its resting mechanical structure without changing its control policy. We found that, especially in the case of "deep insult", such as removal of all four of the robot's legs, the damaged machine evolves shape changes that not only recover the original level of function (locomotion) as before, but can in fact surpass the original level of performance (speed). This suggests that shape change, instead of control readaptation, may be a better method to recover function after damage in some cases.

show abstract

Reset-free Trial-and-Error Learning for Robot Damage Recovery

Cited by 94 publications

References 58 publications

Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots

Deep Reinforcement Learning for Single-Shot Diagnosis and Adaptation in Damaged Robots

Graph-Based Controller Synthesis for Safety-Constrained, Resilient Systems

Automated Shapeshifting for Function Recovery in Damaged Robots

Contact Info

Product

Resources

About