Adaptive Learning of Linguistic Hierarchy in a Multiple Timescale Recurrent Neural Network

Heinrich, S.; Weber, Cornelius; Wermter, Stefan

doi:10.1007/978-3-642-33269-2_70

Cited by 9 publications

(7 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, our knowledge of the hierarchical nature of linguistic structures and the theory of linguistic compositionality have been shown to be biologically plausible. Previous works have applied this hierarchical structure to RNNs in movement tracking (Paine and Tani, 2004), sensorimotor control systems (Yamashita and Tani, 2008) and speech recognition (Heinrich et al, 2012). Based on the above conclusions, we adopt the multiple timescales concept to implement the temporal hierarchy architecture for representing multiple compositionalities which will help in handling longer sequences for our CLM.…”

Section: Related Workmentioning

confidence: 95%

“…In order to improve the performance of the CLMs, there is a need for better representation of the additional levels of compositionality and the richer discourse structure found in CLMs. Heinrich et al (2012) used multiple timescale RNNs to learn the linguistic hierarchy for speech related tasks and Ding et al (2016) demonstrated that, during listening to connected speech, cortical activity of different timescales concurrently tracked the time course of abstract linguistic compositionality at different hierarchical levels, such as words, phrases and sentences. In this work, we propose a character-level recurrent neural network (RNN) LM that employs an adaptive multiple timescales approach to incorporate temporal hierarchies in the architecture to enhance the representation of multiple compositionalities.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Representing Compositionality based on Multiple Timescales Gated Recurrent Neural Networks with Adaptive Temporal Hierarchy for Character-Level Language Models

Moirangthem

Son²

2017

Proceedings of the 2nd Workshop on Representation Learning for NLP

View full text Add to dashboard Cite

A novel character-level neural language model is proposed in this paper. The proposed model incorporates a biologically inspired temporal hierarchy in the architecture for representing multiple compositions of language in order to handle longer sequences for the character-level language model. The temporal hierarchy is introduced in the language model by utilizing a Gated Recurrent Neural Network with multiple timescales. The proposed model incorporates a timescale adaptation mechanism for enhancing the performance of the language model. We evaluate our proposed model using the popular Penn Treebank and Text8 corpora. The experiments show that the use of multiple timescales in a Neural Language Model (NLM) enables improved performance despite having fewer parameters and with no additional computation requirements. Our experiments also demonstrate the ability of the adaptive temporal hierarchies to represent multiple compositonality without the help of complex hierarchical architectures and shows that better representation of the longer sequences lead to enhanced performance of the probabilistic language model.

show abstract

Section: Related Workmentioning

confidence: 95%

Section: Introductionmentioning

confidence: 99%

Representing Compositionality based on Multiple Timescales Gated Recurrent Neural Networks with Adaptive Temporal Hierarchy for Character-Level Language Models

Moirangthem

Son²

2017

Proceedings of the 2nd Workshop on Representation Learning for NLP

View full text Add to dashboard Cite

show abstract

“…These self-organised values are then transferred to the EC layer and associated with the present embodied perception (EI layer). For training the MTRNN we use an adaptive variant of the real-time backpropagation through time (RTBPTT) algorithm [22,45].…”

Section: Learningmentioning

confidence: 99%

“…Compared to the previous ICANN contribution [23], the results are based on twice the number of runs per setup and per experiment. The parameters of the network and the metaparameters were mostly chosen based on the experience in [22] and [25] and are detailed in Tab 1. The number of neurons in the input layers |I IO | and |I EC | are given by the input representations.…”

Section: Evaluation and Analysismentioning

confidence: 99%

Analysing the Multiple Timescale Recurrent Neural Network for Embodied Language Understanding

Heinrich

Magg

Wermter

2015

Springer Series in Bio-/Neuroinformatics

Self Cite

View full text Add to dashboard Cite

How the human brain understands natural language and how we can exploit this understanding for building intelligent grounded language systems is open research. Recently, researchers claimed that language is embodied in most -if not all -sensory and sensorimotor modalities and that the brain's architecture favours the emergence of language. In this chapter we investigate the characteristics of such an architecture and propose a model based on the Multiple Timescale Recurrent Neural Network, extended by embodied visual perception, and tested in a real world scenario. We show that such an architecture can learn the meaning of utterances with respect to visual perception and that it can produce verbal utterances that correctly describe previously unknown scenes. In addition we rigorously study the timescale mechanism (also known as hysteresis) and explore the impact of the architectural connectivity in the language acquisition task.

show abstract

“…These self-organised values are then transferred to the EC layer and associated with the present embodied perception. For training we use an adaptive mechanism based on the resilient propagation algorithm [8]. During testing, the system approximates EC values from the visual perception input that are transferred to the Csc units, which in turn initiate the generation of a corresponding verbal utterance.…”

Section: Extended Mtrnn Modelmentioning

confidence: 99%

Embodied Language Understanding with a Multiple Timescale Recurrent Neural Network

Heinrich

Weber

Wermter

2013

Artificial Neural Networks and Machine Learning – ICANN 2013

Self Cite

View full text Add to dashboard Cite

Abstract. How the human brain understands natural language and what we can learn for intelligent systems is open research. Recently, researchers claimed that language is embodied in most -if not all -sensory and sensorimotor modalities and that the brain's architecture favours the emergence of language. In this paper we investigate the characteristics of such an architecture and propose a model based on the Multiple Timescale Recurrent Neural Network, extended by embodied visual perception. We show that such an architecture can learn the meaning of utterances with respect to visual perception and that it can produce verbal utterances that correctly describe previously unknown scenes.

show abstract

Adaptive Learning of Linguistic Hierarchy in a Multiple Timescale Recurrent Neural Network

Cited by 9 publications

References 9 publications

Representing Compositionality based on Multiple Timescales Gated Recurrent Neural Networks with Adaptive Temporal Hierarchy for Character-Level Language Models

Representing Compositionality based on Multiple Timescales Gated Recurrent Neural Networks with Adaptive Temporal Hierarchy for Character-Level Language Models

Analysing the Multiple Timescale Recurrent Neural Network for Embodied Language Understanding

Embodied Language Understanding with a Multiple Timescale Recurrent Neural Network

Contact Info

Product

Resources

About