Jelena Luketina scite author profile

To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the compositional, relational, and hierarchical structure of the world, and learn to transfer it to the task at hand. Recent advances in representation learning for language make it possible to build models that acquire world knowledge from text corpora and integrate this knowledge into downstream decision making problems. We thus argue that the time is right to investigate a tight integration of natural language understanding into RL in particular. We survey the state of the field, including work on instruction following, text games, and learning from textual domain knowledge. Finally, we call for the development of new environments as well as further investigation into the potential uses of recent Natural Language Processing (NLP) techniques for such tasks.

show abstract

Progress & Compress: A scalable framework for continual learning

Schwarz¹,

Luketina²,

Czarnecki³

et al. 2018

Preprint

View full text Add to dashboard Cite

A Survey of Reinforcement Learning Informed by Natural Language

Luketina¹,

Nardelli²,

Farquhar³

et al. 2019

Preprint

View full text Add to dashboard Cite

Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning

Igl¹,

Farquhar²,

Luketina³

et al. 2020

Preprint

View full text Add to dashboard Cite

Non-stationarity arises in Reinforcement Learning (RL) even in stationary environments. Most RL algorithms collect new data throughout training, using a non-stationary behaviour policy. Furthermore, training targets in RL can change even with a fixed state distribution when the policy, critic, or bootstrap values are updated. We study these types of non-stationarity in supervised learning settings as well as in RL, finding that they can lead to worse generalisation performance when using deep neural network function approximators. Consequently, to improve generalisation of deep RL agents, we propose Iterated Relearning (ITER). ITER augments standard RL training by repeated knowledge transfer of the current policy into a freshly initialised network, which thereby experiences less non-stationarity during training. Experimentally, we show that ITER improves performance on the challenging generalisation benchmarks ProcGen and Multiroom.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jelena Luketina

A Survey of Reinforcement Learning Informed by Natural Language

Progress & Compress: A scalable framework for continual learning

A Survey of Reinforcement Learning Informed by Natural Language

Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning

Contact Info

Product

Resources

About