2018
DOI: 10.1007/978-3-030-01424-7_10
|View full text |Cite
|
Sign up to set email alerts
|

Implementing Neural Turing Machines

Abstract: Neural Turing Machines (NTMs) are an instance of Memory Augmented Neural Networks, a new class of recurrent neural networks which decouple computation from memory by introducing an external memory unit. NTMs have demonstrated superior performance over Long Short-Term Memory Cells in several sequence learning tasks. A number of open source implementations of NTMs exist but are unstable during training and/or fail to replicate the reported performance of NTMs. This paper presents the details of our successful im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
40
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 47 publications
(41 citation statements)
references
References 11 publications
1
40
0
Order By: Relevance
“…Several tools can be applied in the implementations of NTM [83][84][85][86][87]; however, the evaluation parameters are independent of the tools. For the entire performed experiments, the RMSProp algorithm has been used for training with a momentum of 0.9.…”
Section: Tools Setup and Parametersmentioning
confidence: 99%
“…Several tools can be applied in the implementations of NTM [83][84][85][86][87]; however, the evaluation parameters are independent of the tools. For the entire performed experiments, the RMSProp algorithm has been used for training with a momentum of 0.9.…”
Section: Tools Setup and Parametersmentioning
confidence: 99%
“…Neural Turing Machine (NTM) explores the concept of evidently extending the context accumulator of RNN with an addressable external memory. They are an example of Memory Augmented Neural Networks, which decouple the computation from memory [13]. NTM have been shown to outperform LSTMs on sequence learning tasks demanding large memory for handling memorization of more extended contexts.…”
Section: Ntmmentioning
confidence: 99%
“…The interaction between the Controller and Memory matrix is carried out by reading and write heads. The memory matrix is initialized using schemes like Constant initialization or Truncated Normal distribution [13]. The NTM model can be trained by variants of stochastic gradients using backpropagation through time in case of an RNN based controller.…”
Section: Ntmmentioning
confidence: 99%
“…This is the case for the Neural Turing Machine [3] for which a neural network is complemented by a flow control mechanism and memory capabilities. Although the enhanced network is differentiable end-to-end, it remains somehow difficult to parameterize and very sensitive to initialization values [4]. Complex neural networks have often proved difficult to train.…”
Section: Introductionmentioning
confidence: 99%