MEMO: A Deep Network for Flexible Combination of Episodic Memories

Banino, Andrea; Badia, Adrià Puigdomènech; Köster, Raphael; Chadwick, Martin J.; Zambaldi, Vinicius; Hassabis, Demis; Barry, Caswell; Botvinick, Matthew; Kumaran, Dharshan; Blundell, Charles

doi:10.48550/arxiv.2001.10913

Cited by 8 publications

(10 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Transformers have been shown to outperform RNNs in many tasks in both NLP and computer vision. In particular, their ability to directly access historical states and to learn complex interactions among them, has been shown to excel in tasks that require complex long-term temporal dependencies such as memory-based reasoning (Ritter et al, 2020;Banino et al, 2020). Furthermore, they have been shown to be effective for temporal generation in both language and visual domains.…”

Section: Transdreamermentioning

confidence: 99%

“…To deal with partial observability (Kaelbling et al, 1998), the dynamics models in MBRL have been implemented using recurrent neural networks (RNNs) (Hafner et al, 2019;Schrittwieser et al, 2020;Kaiser et al, 2019). However, Transformers (Vaswani et al, 2017;Dai et al, 2019) have shown to be more effective than RNNs in many domains requiring long-term dependency and direct access to memory for a form of memory-based reasoning (Ritter et al, 2020;Banino et al, 2020). Also, it has been shown that training complex policy networks based on transformers using only rewards is difficult (Parisotto et al, 2020), so learning a transformer-based world model where the training signal is more diverse may facilitate learning.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

TransDreamer: Reinforcement Learning with Transformer World Models

Chen¹,

Yoon²,

Wu³

et al. 2022

Preprint

View full text Add to dashboard Cite

The Dreamer agent provides various benefits of Model-Based Reinforcement Learning (MBRL) such as sample efficiency, reusable knowledge, and safe planning. However, its world model and policy networks inherit the limitations of recurrent neural networks and thus an important question is how an MBRL framework can benefit from the recent advances of transformers and what the challenges are in doing so. In this paper, we propose a transformer-based MBRL agent, called TransDreamer. We first introduce the Transformer State-Space Model, a world model that leverages a transformer for dynamics predictions. We then share this world model with a transformer-based policy network and obtain stability in training a transformer-based RL agent. In experiments, we apply the proposed model to 2D visual RL and 3D first-person visual RL tasks both requiring long-range memory access for memory-based reasoning. We show that the proposed model outperforms Dreamer in these complex tasks.

show abstract

Section: Transdreamermentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

TransDreamer: Reinforcement Learning with Transformer World Models

Chen¹,

Yoon²,

Wu³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In spirit, methods for Program induction tend to be closer to neural networks than to symbolic computing. For instance, architectures such as the Neural Turing Machine [197,198], the Differential Neural Computer [198,199], the Neural programmer [200], Neural programmer-interpreters [201,195], Neural Program Lattices [202], the Neural State Machine [200], and most recently MEMO [203] extend neural networks with external memory, and can infer simple algorithms such as adding numbers, copying, sorting and path finding. [165] illustrates the use of DeepProbLog to solve three program induction tasks and compare their results to Differentiable Forth (∂4) [204].…”

Section: Program Synthesismentioning

confidence: 99%

Understanding in Artificial Intelligence

Maetschke¹,

Iraola²,

Barnard³

et al. 2021

Preprint

View full text Add to dashboard Cite

Current Artificial Intelligence (AI) methods, most based on deep learning, have facilitated progress in several fields, including computer vision and natural language understanding. The progress of these AI methods is measured using benchmarks designed to solve challenging tasks, such as visual question answering. A question remains of how much understanding is leveraged by these methods and how appropriate are the current benchmarks to measure understanding capabilities. To answer these questions, we have analysed existing benchmarks and their understanding capabilities, defined by a set of understanding capabilities, and current research streams. We show how progress has been made in benchmark development to measure understanding capabilities of AI methods and we review as well how current methods develop understanding capabilities.

show abstract

“…For example, Lample et al (2019) proposed to solve the under-fitting problem of Transformer by introducing a product-key layer that is similar to a memory module. Banino et al (2020) proposed MEMO, an adaptive memory to reason over long-distance texts. Compared to these studies, the approach proposed in this paper focuses on leveraging memory for decoding rather than encoding, and presents a relational memory to learn from previous generation processes as well as patterns for long text generation.…”

Section: Related Workmentioning

confidence: 99%

Generating Radiology Reports via Memory-driven Transformer

Chen¹,

Yan²,

Chang³

et al. 2020

Preprint

View full text Add to dashboard Cite

Medical imaging is frequently used in clinical practice and trials for diagnosis and treatment. Writing imaging reports is time-consuming and can be error-prone for inexperienced radiologists. Therefore, automatically generating radiology reports is highly desired to lighten the workload of radiologists and accordingly promote clinical automation, which is an essential task to apply artificial intelligence to the medical domain. In this paper, we propose to generate radiology reports with memorydriven Transformer, where a relational memory is designed to record key information of the generation process and a memory-driven conditional layer normalization is applied to incorporating the memory into the decoder of Transformer. Experimental results on two prevailing radiology report datasets, IU X-Ray and MIMIC-CXR, show that our proposed approach outperforms previous models with respect to both language generation metrics and clinical evaluations. Particularly, this is the first work reporting the generation results on MIMIC-CXR to the best of our knowledge. Further analyses also demonstrate that our approach is able to generate long reports with necessary medical terms as well as meaningful image-text attention mappings. 1

show abstract

MEMO: A Deep Network for Flexible Combination of Episodic Memories

Cited by 8 publications

References 25 publications

TransDreamer: Reinforcement Learning with Transformer World Models

TransDreamer: Reinforcement Learning with Transformer World Models

Understanding in Artificial Intelligence

Generating Radiology Reports via Memory-driven Transformer

Contact Info

Product

Resources

About