Safer Reinforcement Learning through Transferable Instinct Networks

Grbic, Djordje; Risi, Sebastian

doi:10.1162/isal_a_00449

Cited by 4 publications

(1 citation statement)

References 27 publications

(35 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Importantly, STELLAR integrated 11 innovative components that solve different challenges and requirements for LL. It employed Sliced Cramer Preservation (SCP) (Kolouri et al, 2020), or the sketched version of it (SCP++) (Li et al, 2021), and Complex Synapse Optimizer (Benna and Fusi, 2016) to overcome catastrophic forgetting of old tasks; Self-Preserving World Model (Ketz et al, 2019) and Context-Skill Model (Tutum et al, 2021) for backward transfer to old tasks as well as forward transfer to their variants; Neuromodulated Attention (Zou et al, 2020) for rapid performance recovery when an old task repeats; Modulated Hebbian Network (Ladosz et al, 2022) and Plastic Neuromodulated Network (Ben-Iwhiwhu et al, 2021) for rapid adaptation to new tasks; Reflexive Adaptation (Maguire et al, 2021) and Meta-Learned Instinct Network (Grbic and Risi, 2021) to safely adapt to new tasks; and Probabilistic Program Neurogenesis (Martin and Pilly, 2019) to scale up the learning of new tasks during fielded operation. More details on the precise effect of each of these components are beyond the scope of this paper; however, this case study outlines how the integrated system dynamics demonstrated LL using the proposed metrics, and how these metrics shaped the advancement of the SG-HRL system.…”

Section: System Group Hrl -Carla 531 System Overviewmentioning

confidence: 99%

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Baker¹,

New²,

Aguilar-Simon³

et al. 2023

Preprint

View full text Add to dashboard Cite

Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through the development of "Lifelong Learning" systems that are capable of 1) Continuous Learning, 2) Transfer and Adaptation, and 3) Scalability. Unfortunately, efforts to improve these capabilities are typically treated as distinct areas of research that are assessed independently, without regard to the impact of each separate capability on other aspects of the system. We instead propose a holistic approach, using a suite of metrics and an evaluation framework to assess Lifelong Learning in a principled way that is agnostic to specific domains or system techniques. Through five case studies, we show that this suite of metrics can inform the development of varied and complex Lifelong Learning systems. We highlight how the proposed suite of metrics quantifies performance trade-offs present during Lifelong Learning system development -both the widely discussed Stability-Plasticity dilemma and the newly proposed relationship between Sample Efficient and Robust Learning. Further, we make recommendations for the formulation and use of metrics to guide the continuing development of Lifelong Learning systems and assess their progress in the future.

show abstract