Olivier Tieleman scite author profile

Olivier Tieleman

5Publications

124Citation Statements Received

78Citation Statements Given

How they've been cited

122

How they cite others

Affiliations

Publications

Order By: Most citations

Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning

Lazaridou¹,

Potapenko²,

Tieleman³

2020

View full text Add to dashboard Cite

We present a method for combining multiagent communication and traditional datadriven approaches to natural language learning, with an end goal of teaching agents to communicate with humans in natural language. Our starting point is a language model that has been trained on generic, not task-specific language data. We then place this model in a multi-agent self-play environment that generates task-specific rewards used to adapt or modulate the model, turning it into a taskconditional language model. We introduce a new way for combining the two types of learning based on the idea of reranking language model samples, and show that this method outperforms others in communicating with humans in a visual referential communication task. Finally, we present a taxonomy of different types of language drift that can occur alongside a set of measures to detect them.

show abstract

Never Give Up: Learning Directed Exploration Strategies

Badia¹,

Sprechmann²,

Vitvitskyi³

et al. 2020

Preprint

View full text Add to dashboard Cite

Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning

Lazaridou¹,

Potapenko²,

Tieleman³

2020

Preprint

View full text Add to dashboard Cite

show abstract

Large-Scale Retrieval for Reinforcement Learning

Humphreys¹,

Guez²,

Tieleman³

et al. 2022

Preprint

View full text Add to dashboard Cite

Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning, the dominant paradigm is for an agent to amortise information that helps decisionmaking into its network weights via gradient descent on training losses. Here, we pursue an alternative approach in which agents can utilise large-scale contextsensitive database lookups to support their parametric computations. This allows agents to directly learn in an end-to-end manner to utilise relevant information to inform their outputs. In addition, new information can be attended to by the agent, without retraining, by simply augmenting the retrieval dataset. We study this approach in Go, a challenging game for which the vast combinatorial state space privileges generalisation over direct matching to past experiences. We leverage fast, approximate nearest neighbor techniques in order to retrieve relevant data from a set of tens of millions of expert demonstration states. Attending to this information provides a significant boost to prediction accuracy and game-play performance over simply using these demonstrations as training trajectories, providing a compelling demonstration of the value of large-scale retrieval in reinforcement learning agents.

show abstract

Formalising Concepts as Grounded Abstractions

Clark¹,

Lerchner²,

Glehn³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.